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GENE EXPRESSION PROFILING IN BIOPSIED TUMOR TISSUES 

Background of the Invention 

Cross-Reference 

5 This application claims the benefit tinder 35 U.S.C. 1 19(h) of provisional applications 

serial nos. 60/412,049, filed September 18, 2002 and 60/364,890, filed March 13, 2002, the 
entire disclosures which are herby incorporated by reference. 

Field of the Invention 

10 The present invention relates to gene expression profiling in biopsied tumor tissues. In 

particular, the present invention concerns sensitive methods to measure mRNA levels in biopsied 
tumor tissues, including archived paraffin-embedded biopsy material. In addition, the invention 
provides a set of genes the expression of which is important in the diagnosis and treatment of 
breast cancer. 

15 Oncologists have a number of treatment options available to them, including different 

combinations of chemotherapeutic dmgs that are characterized as "standard of care," and a 
number of dmgs that do not carry a label claim for a particular cancer, but for which there is 
evidence of efficacy in that cancer* Best likelihood of good treatment outcome requires that 
patients be assigned to optimal available cancer treatment, and that this assignment be made as 

20 quickly as possible following diagnosis* 

Ciarrently, diagnostic tests used in clinical practice are single anaiyte, and therefore do not 
capture the potential value of knowing relationships between dozens of different markers. 
Moreover, diagnostic tests are firequently not quantitative, relying on iromunohistochemistry. 
This method often jdelds different results in different laboratories, in part because Ihe reagents 

25 are not standardized, and in part because the interpretations are subjective and cannot be easily 
quantified. KNTA-based tests have not often been used because of the problem of RNA 
degradation over time and the fact that it is difficult to obtain fresh tissue samples from patients 
for analysis* Fixed paraffin-embedded tissue is more readily available and methods have been 
established to detect RNA in fixed tissue. However, these methods typically do not allow for the 

30 study of large numbers of genes (DNA or RNA) from small amoimts of material. Thus, 
traditionally fixed tissue has been rarely used other than for immunohistochemistry detection of 
proteins. 

Recently, several groups have published studies concerning the classification of various 
cancer by micioarray gene expression analysis (see, e.g. Golub et al.. Science 286:531-537 
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(1999); Bhattacharjae et aL, Proc, Natl Acad, Sci. USA 98:13790-13795 (2001); Chen-Hsiang et 
al, Bioinformatics 17 (SuppL 1):S316-S322 (2001); Ramaswamy et aU Proc. Natl Acad Set 
USA 98:15149-15154 (2001)). Certain classifications of human breast cancers based on gene 
expression patterns have also been reported (Martin et al. Cancer Res, 60:2232-2238 (2000); 
5 West etaL, Proc, Natl Acad Scl USA 98:11462-11467 (2001); Sorlie et al, Proc, Natl Acad. 
Scl USA 98:10869-10874 (2001); Yan et al. Cancer Res. 61:8375-8380 (2001)). However, 
these studies mostly focus on improving and refining the already established classification of 
various types of cancer, including breast cancer, and generally do not provide new insights into 
the relationships of the differentially expressed genes, and do not link the findings to treatment 

10 strategies in order to improve the clinical outcome of cancer therapy. 

Although modem molecular biology and biochemistry have revealed more than 100 genes 
whose activities influence the behavior of tumor cells, state of their differentiation, and their 
sensitivity or resistance to certain therapeutic drugs, with a few exceptions, the status of these 
genes has not been exploited for the purpose of routinely making clinical decisions about dmg 

15 treatments. One notable exception is the use of estrogen receptor (ER) protein expression in 
breast carcinomas to select patients to treatment with anti-estrogen dmgs, such as tamoxifen. 
Another exceptional example is the use of ErbB2 (Her2) protein expression in breast carcinomas 
to select patients with the Her2 antagonist drag Herceptin® (Genentech, Inc., South San 
Francisco, CA). 

20 Despite recent advances, the challenge of cancer treatment remains to target specific 

treatment regimens to pathogenically distinct tumor types, and ultimately personalize tumor 
treatment in order to maximize outcome. Hence, a need exists for tests that simultaneously 
provide predictive information about patient responses to the variety of treatment options. This 
is particularly trae for breast cancer, the biology of which is poorly imderstood. It is clear that 

25 the classification of breast cancer into a few subgroups, such as ErbB2'^ subgroup, and subgroups 
characterized by low to absent gene expression of the estrogen receptor (ER) and a few additional 
transcriptional factors (Perou et al. Nature 406:747-752 (2000)) does not reflect the cellular and 
molecular heterogeneity of breast cancer, and does not allow the design of treatment strategies 
maximizing patient response. 

30 

Summary of the Invention 
The present invention provides (1) sensitive me&ods to measure mRNA levels in 
biopsied tumor tissue, (2) a set of approximately 190 genes, the expression of which is important 
in the diagnosis of breast cancer, and (3) the significance of abnormally low or high expression 

2 

SUBSTITUTE SHEET (RULE 26) 



wo 2003/078662 PCT/US2003/007713 

for the genes identified and included in the gene set, through activation or disruption of 
biochemical regulatory pathways that influence patient response to particular drugs used or 
potentially useful in the treatment of breast cancer. These results permit assessment of genomic 
evidence of the efficacy of more than a dozen relevant drugs. 

The present invention accommodates the use of archived paraffin-embedded biopsy 
material for assay of all markers in the set, and therefore is compatible with the most widely 
available type of biopsy material. The mvention presents an eflScient method for extraction of 
RNA from wax-embedded, fixed tissues, which reduces cost of mass production process for 
acquisition of this information without sacrificing quality of the analysis. In addition, the 
invention describes a novel highly effective method for amplifying mRNA copy number, which 
permits increased assay sensitivity and the ability to monitor expression of large numbers of 
different genes given the limited amounts of biopsy material. The invention also captures the 
predictive significance of relationships between expressions of certain markers in the breast 
cancer marker set. Finally, for each member of the gene set, the invention specifies the 
oligonucleotide sequences to be used in the test 

In one aspect, the invention concerns a method for predicting clinical outcome for a 
patient diagnosed with cancer, comprising 

determining the expression level of one or more genes, or their expression products, 
selected from the group consisting of p53BP2, cathepsin B, cathepsin/L, Ki67/MiBl, and 
thymidine kinase in a cancer tissue obtained from the patient, normalized against a control gene 
or genes, and compared to the amotmt found in a reference cancer tissue set, 

wherein a poor outcome is predicted if: 

(a) the expression level of p53BP2 is in the lower 10* percentile; or 

(b) the expression level of either cathepsm B or cathepsin L is in the upper 10* 
percentile; or 

(c) ttie expression level of any either Ki67/MiBl or thymidhie kinase is in the upper 
10* percentile. 

Poor clinical outcome can be measured, for example, in terms of shortened survival or 
increased risk of cancer recurrence, e.g, following surgical removal of the cancer. 

In another embodiment, the inventor concerns a method of predicting the likelihood of 
the recurrence of cancer, following treatment, in a cancer patient, comprising determining the 
expression level of p27, or its expression product, in a cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 
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cancer tissue set, wherein an expression level in the upper 10th percentile indicates decreased 
risk of recurrence following treatment. 

In another aspect, the invention concerns a method for classifying cancer comprising, 
determining the expression level of two or more genes selected from the group consisting of 
5 Bcl2, hepatocyte nuclear factor 3, ER, ErbB2, and Grb7, or their expression products, in a cancer 
tissue, normalized against a control gene or genes, and compared to the amount found hi a 
reference cancer tissue set, wherein (i) tumors expressing at least one of Bcl2, hepatocyte 
nuclear factor 3, and ER, or their expression products, above the mean expression level in the 
reference tissue set are classified as having a good prognosis for disease fi*ee and overall patient 
10 survival following treatment; and (ii) tumors expressing elevated levels of ErbB2 and Grb7, or 
their expression products, at levels ten-fold or more above the mean expression level in the 
reference tissue set are classified as having poor prognosis of disease free and overall patient 
survival following treatment. 

All types of cancer are included, such as, for example, breast cancer, colon cancer, lung 
15 cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, 
ovarian cancer, liver cancer, bladder cancer, cancer of Ihe urinary tract, thyroid cancer, renal 
cancer, carcinoma, melanoma, and brain cancer. The foregoing methods are particularly suitable 
for prognosis/classification of breast cancer. 

In all previous aspects, in a specific embodiment, the expression level is determined using 
20 RNA obtained fi-om a formalin-fixed, paraffin-embedded tissue sample. While all techniques of 
gene expression profiling, as well as proteomics techniques, are suitable for use in performing 
the foregoing aspects of the invention, the gene expression levels are often determined by reverse 
transcription polymerase chain reaction (RT-PCR). 

If the source of the tissue is a formalin-fixed, paraffin embedded tissue sample, the RNA 
25 is often fragmented. 

The expression data can be finther subjected to multivariate analysis, for example using 
the Cox Proportional Hazards model. 

In a farther aspect, the invention concerns a method for the preparation of nucleic acid 
from a fixed, wax-embedded tissue specimen, comprising: 
30 (a) incubating a section of the fixed, wax-embedded tissue specimen at a temperature 

of about 56 °C to 70 ""C in a lysis buffer, in the presence of a protease, without prior dewaxing, to 
form a lysis solution; 

(b) cooling the lysis solution to a temperature vsiiere the wax solidifies; and 

(c) isolating the nucleic acid from the lysis solution. 
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The lysis buffer may comprise urea, such as 4M urea. 

In a particular embodiment, incubation in step (a) of the foregoing method is performed at 
about 65°C. 

In another particular embodimentj, the protease used in the foregoing method is proteinase 

5 K- 

In another embodiment, the cooling in step (b) is performed at room temperature. 
In a further embodiment, the nucleic acid is isolated after protein removal wjth 2,5 M 
NH4OAC, 

The nucleic acid can, for example, be total nucleic acid present in the fixed, wax- 
10 ^ embedded tissue specimen. 

In yet another embodiment, the total nucleic acid is isolated by precipitation from the 
lysis solution, following protein removal, with 2.5 M NH4OAC. The precipitation may, for 
example, be performed with isopropanol. 

The method described above may further comprise the step of removing DNA from the 
1 5 total nucleic acid, for example by DNAse treatment. 

The tissue specimen may, for example, be obtained from a tumor, and the RNA may be 
obtained from a microdissected portion of the tissue specimen enriched for tumor cells. 

All types of tumor are included, such as, without limitation, breast cancer, colon cancer, 
lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical 
20 cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, 
renal cancer, carcinoma, melanoma, and brain cancer, in particular breast cancer.. 

The method described above may further comprise the step of subjecting the RNA to 
gene expression profiling. Thus, the gene expression profile may be completed for a set of genes 
comprising at least two of the genes listed in Table 1 . 
25 Although all methods of gene expression profiling are contemplated, in a particular 

embodiment, gene expression profiling is performed by RT-PCR which may be preceded by an 
amplification step. 

In another aspect, the invention concerns a method for preparing fragmented RNA for 
gene expression analysis, comprising the steps of: 
30 (a) mixing the RNA with at least one gene-specific, single-stranded DNA scaffold 

under conditions such that fragments of the RNA complementary to the DNA scaffold hybridize 
with the DNA scaffold; 

(b) extendmg the hybridized RNA fragments with a DNA polymerase to form a 
" DNA-DNA duplex; and 
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(c) removing the DNA scaffold from the duplex. 

In a specific embodiment, in step (b) of this method, the RNA may be mixed witihi a 
mixture of single-stranded DNA templates specific for each gene of interest. 

The method can further comprise the step of heat-denaturing and reannealing the 
duplexed DNA to the DNA scaffold, with or without additional overlapping scaffolds, and 
fiirther extending the duplexed sense strand with DNA polymerase prior to removal of the 
scaffold in step (c). 

The DNA templates may be, but do not need to be, fully complementary to the gene of 
interest. 

In a particular embodiment, at least one of the DNA templates is complementary to a 
specific segment of the gene of interest. 

In another embodunent, the DNA templates include sequences complementary to 
polymorphic variants of the same gene. 

The DNA template may include one or more dUTP or rNTP sites. In this case, iin step (c) 
the DNA template may be removed by fiiagmenting the DNA template present in the DNA-DNA 
duplex formed in step (b) at the dUTP or rNTP sites. 

In an important embodiment, the RNA is extracted from fixed, wax-embedded tissue 
specimens, and purified sufficiently to act as a substrate in an enzyme assay. The RNA 
purification may, but does not need to, include an oligo-dT based step. . 

In a further aspect, the invention concerns a method for amplifying RNA Segments in a 
sample comprising fragmented RNA representing at least one gene of interest, comprising the 
steps of: 

(a) contacting the sample with a pool of single-stranded DNA scaffolds comprising 
an RNA polymerase promoter at the 5' end under conditions such that the RNA fragments 
complementary to the DNA scaffolds hybridize with the DNA scaffolds; 

(b) extending the hybridized RNA fragments with a DNA polymerase along the DNA 
scaffolds to form DNA-DNA duplexes; 

(c) amplifying the gene or genes of interest by in vitro transcription; and 

(d) removing the DNA scaffolds from the duplexes. 

An exemplary promoter is the T7 KNA polymerase promoter, while an exemplary DNA 

polymerase is DNA polymerase I. 

In step (d) the DNA scaffolds may be removed, for example, by treatment with DNase L 
In a further embodiment, the pool of single-stranded DNA scaffolds comprises partial or 

complete gene sequences of interest, such as a library of cDNA clones. 

6 
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In a specific embodiment, the sample represents a whole genome or a fraction thereof. In 
a preferred embodiment^ the genome is the human genome. 

In another aspect, the invention concerns a method of preparing a personalized genomics 
profile for a patient, comprising the steps of: 
5 (a) subjecting RNA extracted fi-om a tissue obtained from the patient to gene 

expression analysis; 

(b) determining the expression level in such tissue of at least two genes selected from 
the gene set listed in Table 1, wherein the expression level is normalized against a control gene 
or genes, and is compared to the amount found in a cancer tissue reference set; 
10 (c) and creating a report summarizit^ the data obtained by the gene expression 

analysis. 

The tissue obtained fi-om the patient may, but does not have to, comprise cancer cells. 
Just as before, the cancer can, for example, be breast cancer, colon cancer, limg cancer, prostate 
cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, 

15 liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, 
melanoma, or brain cancer, breast cancer being particularly preferred. 

In a particular embodiment, the RNA is obtained from a microdissected portion of breast 
cancer tissue enriched for cancer cells. The control gene set may, for example, comprise S-actin, 
and ribosomal protein LPO. 

20 The report prepared for the use of the patient or the patient's physician, may include the 

identification of at least one drug potentially beneficial in the treatment of the patient. 

Step (b) of the foregoing method may comprise the step of determining the expression 
level of a gene specifically infiuencing cellular sensitivity to a drug, where the gene can, for 
example, be selected from the group consisting of aldehyde dehydrogenase lAl, aldehyde 

25 dehydrogenase 1A3, amphiregulin, ARG, BRK, BCRP, CD9, CD31, CD82/KAI-1, COX2, c-abl, 
c-kit, c-kit L, CYPIBI, CYP2C9, DHFR, dihydropyrimidine dehydrogenase, EGF, epiregulin, 
ER-alpha, ErbB-1, ErbB-2, ErbB-3, ErbB-4, ER-beta, famesyl pyrophosphate synthetase, 
ganama-GCS (glutamyl cysteine synthetase), GATA3, geranyl geranyl pyrophosphate synthetase, 
Grb7, GST^alpha, GST-pi, HB-EGF, hsp 27, human chorionic gonadotropin/CGA, IGF-1, IGF-2, 

30 IGFIR, KDR, LIVl, Lung Resistance Protein/MVP, Lotl, MDR-1, microsomal epoxide 
hydrolase, MMP9, MRPl, MRP2, MRP3, MRP4, PAIl, PDGF-A, PDGF^B, PDGF-C, PDGF-D, 
PGDFR-alpha, PDGFR-beta, PLAGa (pleiomorphic adenoma 1), PREP prolyl endopeptidase, 
progesterone receptor, pS2/trefoil factor 1, PTEN, PTBlb, RAR-alpha, RAR-beta2, Reduced 
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Folate Carrier, SXR, TGF-alpha, thymidine phosphorylase, thymidine synthase, topoisomerase 
Il-alpha, topoisomerase Il-beta, VEGF, XIST, and YB-1 . 

In another embodiment^ step (b) of the foregoing process includes determining the 
expression level of multidrug resistance factors, such as, for example, gamma-glutamyl-cysteine 
5 synthetase (GCS), GST-a, GST-tc, MDR-1, MRPl-4, breast cancer resistance protein (BCRP), 
lung cancer resistance protein (MVP), SXR, or YB-1 . 

In another embodiment, step (b) of the foregoing process comprises determination of the 
expression level of eukaiyotic translation initiation factor 4E (EIF4E). 

In yet another embodiment, step (b) of the foregoing process comprises determination of 
10 the expression level of a DNA repair en2yme. 

In a further embodiment, step (b) of the foregoing process comprises determination of the 
expression level of a cell cycle regulator, such as, for example, c-MYC, c-Src, Cyclin Dl, Ha- 
Ras, mdm2. pl4ARF, p21WAFl/CI, pl6XNK4a/pl4, p23, p27, p53, PI3K, PKC-epsilon, or 
PKC-delta. 

15 In a still further embodiment, step (b) of the foregoing process comprises determination of 

the expression level of a tumor suppressor or a related protein, such as, for example, APC or E- 
cadherin. 

In another embodiment, step (b) of the foregoing method comprises determination of the 
expression level of a gene regulating apoptosis, such as, for example, p53, BC12, Bcl-xl, Bak, 
20 Bax, and related factors, NFk-B, CIAPl, CIAP2, survivin, and related factors, p53BPl/ASPPl, 
orp53BP2/ASPP2. 

In yet another embodiment, step (b) of the foregoing process comprises determination of 
the expression level of a factor that controls cell invasion or angiogenesis, such as, for example, 
uPA, PAIl , cathepsin B, C, and L, scatter factor (HGF), c-met, KDR, VEGF, or CD3 1 . 
25 In a different embodiment, step (b) of the foregoing method comprises detemiination of 

the expression level of a marker for immune or inflammatory cells or processes, such as, for 
example, Ig light chain X, CD18, CD3, CD68. Fas(CD95), or Fas Ligand. 

In a further embodiment, step (b) of the foregoing process comprises determination of the 
expression level of a cell proliferation marker, such as, for example, Ki67/MiBl, PCNA, Pinl, or 
30 thymidine kinase. 

In a still further embodiment, step (b) of the foregoing process comprises determination of 
the expression level of a growth factor or growth factor receptor., such as, for example, IGFl, 
IGF2, IGFBP3, IGFIR, FGF2, CSF-I, CSF-lR/fins, SCF-1, JL6 or IL8. 
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In another embodiment, step (b) of the foregoing process comprises detemiination of the 
expression level of a gene marker that defines a subclass of breast cancer, where the gene marker 
can, for example, be GROl oncogene alpha, Grb7, cytokeratins 5 and 17, retinol binding protein 
4, hepatocyte nuclear factor 3, integrin subimit alpha 7, or lipoprotein lipase. 
5 In a still further aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to 5-fluorouracil (5-FU) or an analog thereof, comprising 
the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis; 

10 (b) determining the expression level in the tissue of thymidylate synthase mRNA, 

wherein the expression level is normalized against a control gene or genes, and is compared to 
the amount fotmd in a reference breast cancer tissue set; and 

(c) predicting patient response based on the normalized thymidylate synthase niRNA 
level. , 

15 Step (d) of the foregoing method can ftather comprise determining the expression level of 

dihydropyrimidine phosphorylase. 

In another embodiment, step (b) of the method can further comprise determining the. 
expression level of thymidine phosphorylase. 

In yet another embodiment, a positive response to 5-FU or an analog thereof is predicted 

20 if: (i) normalized thymidylate synthase mRNA level determined in step (b) is at or below the 15* 
percentile; or (ii) the sum of normalized expression levels of thymidylate synthase and 
dihydropyrimidine phosphorylase determined in step (b) is at or below the 25^ percentile; or (iii) 
the sum of normalized expression levels of thymidylate synthase, dihydropyrimidine 
phosphorylase, plus thymidine phosphorylase determined in step (b) is at or below the 20* 

25 percentile.. 

In a further embodiment, in step (b) of the foregoing method the expression level of c-myc 
and wild-type p53 is determined. In this case, a positive response to 5-FU or an analog thereof is 
predicted, if the normalized expression level of c-myc relative to the normalized expression level 
of wild-type p53 is in the upper 15* percentile. 
30 In a still further embodiment, in step (b) of the foregoing method, expression level of 

NFkB and cIAP2 is determined. In this particular embodiment, resistance to 5-FU or an analog 
thereof is typically predicted if the normalized expression level of NFkB and cIAP2 is at or 
above the 1 0* percentile. 
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In anoflier aspect, the invention concerns a method for predicting the response of a patient 
diagnosed with breast cancer to methotrexate or an analog thereof, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

5 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting decreased patient sensitivity to methotrexate or analog if (i) DHFR 
levels are more than tenfold higher than the average expression level of DHFR in the control 
gene set, or (ii) the normalized expression levels of members of the reduced folate carrier (RFC) 
family are below the 10* percentile. 

10 In yet another aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to an anthracyciline or an analog thereof, comprising the 
steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

1 5 or genes, and compared to the amount foimd in a reference breast cancer tissue set; and 

(b) predicting patient resistance or decreased sensitivity to the anthracycline or analog 
if (i) the normalized expression level of topoisomerase Ua is below the 10*^ percentile, or (ii) the 
normalized expression level of topoisomerase Hp is below the 10*** percentile, or (iii) the 
combined normalized topoisomerase Ha or Iip expression levels are below the 10*** percentile. 

20 hi a different aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to a docetaxol, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amoimt foimd in a reference breast cancer tissue set; and 
25 (b) predicting reduced sensitivity to docetaxol if the normalized expression level of 

CYPIBI is in the upper 10* percentile. 

Hie invention fiarther concerns a method for predicting the response of a patient 
diagnosed with breast cancer to cyclophosphamide or an analog thereof, comprising 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
30 gene expression analysis, wherein gene expression levels are normalized against a control gene 

or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting reduced sensitivity to the cyclophosphamide or analog if the sum of the 
expression levels of aldehyde dehydrogenase lAl and 1 A3 is more than tenfold higher than the 
average of their combined expression levels in the reference tissue set. 

10 
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In a further aspect, the invention concerns a method for predicting the response of a 
patient diagnosed with breast cancer to anti-estrogen therapy, comprising ' 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

5 or genes, and compared to the amount found in a reference breast cancer tissue set that contains 
both specimens negative for and positive for estrogen receptor-a (ERa) and progesterone 
receptor-a (PRa); and 

(b) predicting patient response based upon the normalized expression levels of ERa 
or PRa, and at least one of microsomal epoxide hydrolase, pS2/trefoil factor 1, GATA3 and 

10 human chorionic gonadotropin. 

In a specific embodiment, lack of response or decreased responsiveness is predicted if (i) 

the normalized expression level of microsomal epoxide hydrolase is in the upper 10^ percentile; 

or (ii) the normalized expression level of pS2/trefoil factor 1, or GATA3 or human chorionic. 

gonaostropin is at or below the corresponding average expression level in said breast cancer 
15 tissue set, regardless of the expression level of ERa or PRa in the breast cancer tissue obtained 

from the patient. 

In another aspect, the invention concerns a method for predicting the response of a patient 
diagnosed with breast cancer to a taxane, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
20 gene expression analysis, wherein gene expression levels are normalized against a control gene 

or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting reduced sensitivity to taxane if (i) no or minimal XIST expression is 
detected; or (ii) the normalized expression level of GST-tc or propyl endopeptidase (PREP) is in 
the upper 10* percentile; or (iii) the normalized expression level of PLAOl is in the upper 10* 

25 percentile. 

The invention also concerns a method for predicting the response of a patient diagnosed 
with breast cancer to cisplatin or an analog thereof, comprising the steps of: . 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

30 or genes, and compared to the amoimt found in a reference breast cancer tissue set; and 

(b) predicting resistance or reduced sensitivity if the noraialized expression level of 
ERCCl is in the upper 10* percentile. 

The invention further concerns a method for predicting the response of a patient 

diagnosed with breast cancer to an ErbB2 or EGFR ants^onist, comprising the steps of: 
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(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting patient response based on the normalized expression levels of at least 
5 one of Grb7, IGFIR, IGFl and IGF2. 

In particular embodiment, a positive response is predicted if the normalized expression 
level of Grb7 is in the upper 10^ percentile, and the expression of IGFl R, IGFl and IGF2 is not 
elevated above the 90^ percentile. . 

In a further particular embodiment, a decreased responsiveness is predicted if the 
10 expression level of at least one of IGFIR, IGFl and IGF2 is elevated. 

In another aspect, the invaition concerns a method for predicting the response of a patient 
diagnosed with breast cancer to a bis-phosphonate dmg, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

15 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response if the breast cancer tissue obtained from the patient 
expresses mutant Ha-Ras and additionally expresses famesyl pyrophosphate synthetase or 
geranyl pyrophosphone synthetase at a normalized expression level at or above the 90* 
percentile. 

20 In yet another aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to treatment wiflb a cyclooxygenase 2 inhibitor, comprising 
the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

25 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response if the normalized expression level of COX2 in the 
breast csancer tissue obtained from the patient is at or above the 90* percentile* 

The invention further concerns a method for predicting the response of a patient 
diagnosed with breast cancer to an EGF receptor (EGFR) antagonist , comprising the steps of: 
30 ,(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response to an EGFR antagonist, if (i) the normalized 
expression level of EGFR is at or above the 10* percentile, and (ii) the normalized expression 
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level of at least one of epiregulin, TGF-<x, amphiregulin, ErbB3, BRK, CD9, MMP9, CD82, and 
Lotl is above the 90* percentile. 

In another aspect, the invention concerns a method for monitoring the response of a 
patient diagnosed with breast cancer to treatment with an EGFR antagonist, comprising 
5 monitoring the expression level of a gene selected from the group consisting of epiregulin, TGF- 
a, amphiregulin, ErbB3, BRK, CD9, MMP9, CD82, and Lotl m the patient during treatment, 
wherein reduction in the expression level is indicative of positive response to such treatment. 

In yet another aspect, the invention concerns a method for predicting the response of a 
patient diagnosed with breast cancer to a drug targeting a tyrosine kinase selected from the group 
1 0 consisting of abl, c-kit, PDGFR-a, PDGFR-p and ARG , comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amoxmt found in a reference breast cancer tissue set; 

(b) determining the normalized expression level of a tyrosine kinase selected from the 
15 group consisting of abl, c-kit, PDGFR-a, PDGFR-p and ARG, and ttie cognate ligand of the 

tyrosine kinase, and if the normalized expression level of the tyrosine kinase is in the upper 10^ 
percentile, 

(c) determining whether the sequence of the tyrosine kinase contains any mutation, 
wherein a positive response is predicted if (i) the normalized expression level of the 

20 tyrosine kinase is in the upper 10^ percentile, (ii) the sequence of the tyrosine kinase contains an 
activating mutation, or (iii) the normalized expression level of the tyrosine kinase is normal and 
the expression level of the ligand is in the upper 10^ percentile. 

Another aspect of the invention is a method for predicting the response of a patient 
diagnosed with breast cancer to treatment with an anti-angiogenic drug, comprising the steps of: 

25 (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gene expression analysis, \s^erein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response if (i) the normalized expression level of VEGF is in 
the upper 10*^ percentile and (ii) the normahzed expression level of KDR or CD31 is in the upper 

30 20* percentile. 

A further aspect of the invention is a method for predicting the likelihood that a patient 
diagnosed with breast cancer develops resistance to a drug interacting with the MRP-1 gene 
coding for the multidrug resistance protein P-glycoprotein, comprising the steps of: 
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(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis to determine the expression level of PTPlb, wherein .the expression 
level is normalized against a control gene or genes, and compared to the amount found in a 
reference breast cancer tissue set; and 

5 (b) concluding that the patient is likely to develop resistance to said dmg if the 

normalized expression level of the MRP-1 gene is above the 90^ percentile. 

The invention further relates to a method for predictuig the likelihood that a patient 
di^nosed with breast cancer develops resistance to a chemotherapeutic drug or toxin used in 
cancer treatment, comprising the steps of: 
10 (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) determining the normalized expression levels of at least one of the following 
genes: MDRl, SGTa, GSTtt, SXR, BCRP YB-1, and LRP/MVP, wherein the finding of a 

1 5 normalized expression level in the upper 4* percentile is an indication that the patient is likely to 

develop resistance to the drug. 

Also included herein is a method for measuring the translational efficiency of VEGF 

mRNA in a breast cancer tissue sample, comprising determining the expression le^vels of the 

VEGF and EIF4E mRNA in the sample, normalized against a control gene or genes, and 
20 compared to the amount found in a reference breast cancer tissue set, wherein a higher 

normalized EIF4E expression level for the same VEGF expression level is indicative of relatively 

higher translational efficiency for VEGF, 

In another aspect, the invention provides a method for predicting the response of a patient 

diagnosed with breast cancer to a VEGF antagonist, comprising determining the expression level 
25 of VEGF and EIF4E mRNA normalized against a control gene or genes, and compared to the 

amount found in a reference breast cancer tissue set, wherein a VEGF expression level above the 

90* percentile and an EIF4E expression level above the 50* percentile is a predictor of good 

patient response. 

The invention further provides a method for predicting the likelihood of the recurrence of 
30 breast cancer in a patient diagnosed with breast cancer, comprising determining the ratio of 
p53:p21 mRNA expression or p53:mdm2 mRNA expression in a breast cancer tissue obtained 
from the patient, normalized against a control gene or genes, and compared to the amount found 
in a reference breast cancer tissue set, wherein an above normal ratio is indicative of a higher risk 
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of recitrrence. Typically, a higher risk of recurrence is indicated if the ratio is in the upper 10**^ 
percentile. 

In yet another aspect, the invention concerns a method for predicting the likelihood of the 
recurrence of breast cancer in a breast cancer patient following surgery, comprising determining 
5 the expression level of cyclin Dl in a breast cancer tissue obtained from the patient, normalized 
against a control' gene or genes, and compared to the amount found in a reference breast cancer 
tissue set, wherein an expression level in the upper 10* percentile indicates increased risk of 
recurrence following surgery. In a particular embodiment of this method, the patient is subjected 
to adjuvant chemotherapy, if the expression level is in the upper 10* percentile. 

10 Another aspect of the invention is a method for predicting the likelihood of the recurrence 

of breast cancer in a breast cancer patient following surgery, comprising determining the 
expression level of APC or E-cadherin in a breast cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 
breast cancer tissue set, wherein an expression level in the upper 5*^ percentile indicates high risk 

15 of recurrence following surgery, and heightened risk of shortened survival* 

A further aspect of the invention is a method for predicting the response of a patient 
diagnosed with breast cancer to treatment with a proapoptotic dmg comprising determining the 
expression levels of BC12 and c-MYC in a breast cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 

20 breast cancer tissue set, wherein (i) a BC12 expression level in the upper 10* percentile in the 
absence of elevated expression of c-MYC indicates good response, and (ii) a good response is 
not indicated if the expression level c-MYC is elevated, regardless of the expression level of 
BC12. 

A still further aspect of the invention is a method for predicting treatment outcome for a 
25 patient diagnosed with breast cancer, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount foimd in a reference breast cancer tissue set; and 

(b) determining the normalized expression levels of NFkB and at least one gene 
30 selected from the group consisting of cIAPl , cIAP2, XIAP, and Survivin, 

wherein a poor prognosis is indicated if the expression levels for NFkB and at least one 
of the genes selected from the group consisting of cIAPl, cIAP2, XIAP, and Survivin is in the 
upper 5* percentile. 
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The invention further concerns a method for predicting treatment outcome for a patient 
diagnosed with breast cancer, comprising determining the expression levels of p53BPl and 
p53BP2 in a breast cancer tissue obtained from the patient, normalized against a control gene or 
genes, and compared to the amount found in a reference breast cancer tissue set, wherein a poor 
5 outcome is predicted if the expression level of either p53BPl or p53BP2 is in the lower 10^** 
percentile. 

The invention additionally concerns a method for predicting treatment outcome for a 
patient diagnosed with breast cancer, comprising determining the expression levels of uPA and 
PAIl in a breast cancer tissue obtained from the patient, normalized against a control gene or 
10 genes, and compared to the amount found in a reference breast cancer tissue set, wherein (i) a 
poor outcome is predicted if the expression levels of uPA and PAIl are in the upper 20* 
percentile, and (ii) a decreased risk of recurrence is predicted if the expression levels of uPA and 
PAIl are not elevated above the mean observed in the breast cancer reference set. In a particular 
embodiment, poor outcome is measured in terms of shortened survival or increased risk of cancer 
15 recurrence following surgery. In another particular embodiment, uPA and PAIl are expressed at 
normal levels, and the patient is subjected to adjuvant chemotherapy following surgery. 

Another aspect of the invention is a method for predicting treatment outcome in a patient 
diagnosed with breast cancer, comprising determining the expression levels of cathepsin B and 
cathepsin L in a breast cancer tissue obtained from the patient, normalized against a control gene 
20 or genes, and compared to the amount found in a reference breast cancer tissue set, wherein a 
poor outcome is predicted if the expression level of either cathepsin B or cathepsin L is in the 
upper 10*** percentile. Just as before, poor treatment outcome may be measured, for example, in 
terms of shortened survival or increased risk of cancer recurrence. 

A further aspect of the invention is a method for devising the treatment of a patient 
25 diagnosed with breast cancer, comprising the steps of 

(a) determining the expression levels of scatter factor and c-met in a breast cancer 
tissue obtained from the patient, normalized against a control gene or genes, and compared to the 
amoimt foimd in a reference breast cancer tissue set, and 

(t>) . suggesting prompt aggressive chemotherapeutic treatment if the expression levels 
30 of scatter factor and c-met or the combination of both, are above the 90^ percentile. 

A still further aspect of the invention is a method for predicting treatment outcome for a 
patient diagnosed with breast cancer, comprising determining the expression levels of VEGF, 
CD3 1, and KDR in a breast cancer tissue obtained from the patient, normalized against a control 
gene or genes, and compared to the amount found in a reference breast cancer tissue set, wherein 
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a poor treatment outcome is predicted if the expression level of any of VEGF, CD3 1, and KDR is 
in the upper 10* percentile. 

Yet another aspect of the invention is a method for predicting treatment outcome for a 
patient diagnosed with breast cancer, comprising .determining the expression levels of 
5 Ki67/MiBl, PCNA, Pinl, and thymidine kinase in a breast cancer tissue obtained from the 
patient^ normalized against a control gene or genes, and compared to the amount found in a 
reference breast cancer tissue set, wherein a poor treatment outcome is predicted if the expression 
level of any of Ki67/MiBl, PCNA, Pinl, and thymidine kinase is in the upper 10* percentile. 

The invention further concerns a method for predicting treatment outcome for a patient 

10 diagnosed with breast cancer, comprising determining the expression level of soluble and full 
length CD95 in a breast cancer tissue obtained from the patient, , normalized against a control 
gene or genes, and compared to the amoxmt found in a reference breast cancer tissue set, wherein 
the presence of soluble CD95 correlates with poor patient survival. 

The invention also concerns a method for predicting treatment outcome for a patient 

15 diagnosed with breast cancer, comprising determining the expression levels of IGFl, IGFIR and 
IGFBP3 in a breast cancer tissue obtained from the patient, normalized against a control gene or 
genes, and compared to the amount found in a reference breast cancer tissue set, wherein a poor 
treatment outcome is predicted if the sum of the expression levels of IGFl, IGFIR and IGFBP3 
is in the upper 10* percentile. 

20 The invention additionally concerns a method for classifying breast cancer comprising, 

determining the expression level of two or more genes selected from the group consisting of 
Bcll2, hepatocyte nuclear factor 3, LIVl, ER, lipoprotein lipase, retinol binding protein 4, 
integrin a7, cytokeratin 5, cytokeratin 17, GRO oncogen, ErbB2 and Grb7, in a breast cancer 
tissue, normalized against a control gene or genes, and compared to the amount foimd in a 

25 reference breast cancer tissue set, wherein (i) tumors expressing at least one of Bell, hepatocyte 
nuclear factor 3, LIVl, and ER above the mean expression level in the reference tissue set are 
classified as having a good prognosis for disease free and overall patient survival following 
surgical removal; (ii) tumors characterized by elevated expression of at least one of lipoprotein 
lipase, retinol binding protein 4, integrin a7 compared to the reference tissue set are classified as 

30 having intermediate prognosis of disease free and overall patient survival following surgical 
removal; and (iii) tumors expressing either elevated levels of cytokeratins 5 and 17, and GRO 
oncogen at levels four-fold or greater above the mean expression level in the reference tissue set, 
or ErbB2 and Grb7 at levels ten-fold or more above the mean expression level in the reference 
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tissue set are classified as having poor prognosis of disease free and overall patient survival 
following surgical removal. 

Another aspect of the invention is a panel of two or more gene specific primers selected 
from the group consisting of the forward and reverse primers listed in Table 2. 
5 Yet another aspect of the invention is a method for reverse transcription of a fragmented 

RNA population in RT-PCR amplification, comprising using a multiplicity of gene specific 
primers as the reverse primers in the amplification reaction. In a particular embodiment, the 
method uses between two and about 40,000 gene specific primers in the same amplification 
reaction. In another embodiment, the gene specific primers are about 1 8 to 24 bases, such as 

10 about 20 bases in length. In another embodiment, the Tm of the primers is about 58-60 °C- The 
primers can, for example, be selected from the group consisting of the forward and reverse 
primers listed in Table 2. 

The invention also concerns a method of reverse transcriptase driven first strand cDNA 
synthesis, comprising using a gene specific primer of about 18 to 24 bases in length and having a 

15 Tm optimum between about 58 °C and about 60 ®C. In a particular embodiment, the first strand 
cDNA synthesis is followed by PGR DNA amplification, and the primer serves as the reverse 
primer that drives the PGR amplification. In another embodiment, the method uses a plxirality of 
gene specific primers in the same first strand cDNA synthesis reaction mixture. The number of 
the gene specific primers can, for example, be between 2 and about 40,000. 

20 In a different aspect, the invention concerns a method of predicting the likelihood of 

long-term survival of a breast cancer patient without the recurrence of breast cancer, following 
surgical removal of the primary tumor, comprising determining the expression level of one or 
more prognostic RNA transcripts or their product in a breast cancer tissue sample obtained from 
said patient, normalized against the expression level of all RNA transcripts or their products in 

25 said breast cancer tissue sample, or of a reference set of RNA transcripts or their products, 
wherein the prognostic transcript is the transcript of one or more genes selected from the group 
consisting of: FOXMl, PRAME, Bcl2, STK15, GEGPU Ki-67, GSTMl, CA9, PR, BBG3, 
NMEU SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RPS6KB1, Src, Chkl, IDl, EstRl, p27, 
CCNBl, XIAP, Chk2, CDC25B, IGFIR, AK055699, P13KC2A, TGFB3, BAGIl, CYP3A4, 

30 EpCAM, VEGFC, pS2, hENTl, WISPl, HNF3A, NFKBp65, BRCA2, EGFR, TKl, VDR, 
Gontig51037, pENTl, EPHXl, IFIA, DIABLO, CDHl, HIFla, IGFBP3, CTSB, and Her2, 
wherein overexpression of one or more of FOXMl, PRAME, STK15, Ki-67, CA9, NMEl, 
SURV, TFRC, YB-1, RPS6KB1, Src, Chkl, CChJBl, Chk2, CDC25B, CYP3A4, EpCAM, 
VEGFC, hENTl, BRCA2, EGFR, TKl, VDR, EPHXU DFIA, Contig51037, CDHU HIFla, 
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IGFBP3, CTSB, Her2, and pENTl indicates a decreased likelihood of long-term survival without 
breast cancer recurrence, and the overexpression of one or more of Bcl2, CEGP1,.CjSTM1, PR, 
BBC3, GATA3, DPYD, GSTM3, IDl, EstRl, p27, XIAP, IGFIR, AK055699, P13KC2A, 
TGFB3, BAGIl, pS2, WISPl, HNF3A, NFKBp65, and DIABLO indicates an increased 
5 likelihood of long-term survival without breast cancer recurrence. 

In a particular embodiment of this method, the expression level of at least 2, preferably at 
least 5, more preferably at least 10, most preferably at least 15 prognostic transcipts or their 
expression products is determined. 

When the breast cancer is invasive breast carcinoma, including both estrogen receptor 
' 10 (ER) overexpressing (ER positive) and ER negative tumors, the analysis includes determination 
of the expression levels of tiie transcripts of at least two of the following genes, or their 
expression products: FOXMl, PRAME, Bcl2, STK15, CEGPl, Ki-67, GSTMl, PR, BBC3, 
NMEl, SURV, GATA3, TFRC, YB-1, DPYD, Src, CA9, Contig51037, RPS6K1 and Her2. 

When the breast cancer is ER positive invasive breast carcinoma, the analysis includes 
15 dtermination of the expression levels of the transcripts of at least two of the following genes, or 
their expression products: PRAME, Bcl2, FOXMl, DIABLO, EPHXl, HIFl A, VEGFC, Ki-67, 
IGFIR, VDR, NMEl, GSTM3, Contig51037, CDC25B, CTSB, p27, CDHl, and IGFBP3. 

Just as before, it is preferred to determine the expression levels of at least 5, more 
preferably at least 10, most preferably at least 15 genes, or their respective expression products. 
20 In a particular embodiment, the expression level of one or more prognostic RNA 

transcripts is determined, where RNA may, for example, be obtained from a fixed, wax- 
embedded breast cancer tissue specimen of the patient. The isolation of RNA can, for example, 
be carried out following any of the procedures described above or throughout the application, or 
by any other method known in the art. 
25 In yet another aspect, the invention concerns an array comprising polynucleotides 

hybridizmg to the following genes: FOXMl, PRAME, Bcl2, STK15, CEGPl, Ki-67, GSTMl, 
PR, BBC3, NMEl, SURV, GATA3, TFRC, YB-l, DPYD, CA9, Contig51037, RPS6K1 and 
Her2, immobilized on a solid surface. 

In a particular embodiment, the array comprosies polynucleotides hybridizing to the 
30 following genes: FOXMl, PRAME, Bcl2, STK15, CEGPl, Ki-67, GSTMl, CA9, PR, BBC3, 
NMEl, SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RPS6KB1, Src, Chkl, IDl, EstRl, p27, 
CCNBl, XIAP, Chk2, CDC25B, IGFIR, AK055699, P13KC2A, TGFB3, BAGIl, CYP3A4, 
EpCAM, VEGFC, pS2, hENTl, WISPl, HNF3A, NFKBp65, BRCA2, EGFR, TKl, VDR, 
Contig51037, pENTl, EPHXl, IFIA, CDHl, HIFla, IGFBP3, CTSB, Her2 and DIABLO. 
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In a further aspect, the invention concerns a method of predicting the likelihood of long- 
term survival of a patient diagnosed with invasive breast cancer, without the recurrence of breast 
cancer, following surgical removal of the primary tumor, comprising the steps of: 

(1) determining the expression levels of the RNA transcripts or the expression 
5 products of genes of a gene set selected from the group consisting of 

(a) Bcl2, cyclinGl, NFKBp65, NMEl, EPHXl, TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67,XIAP,hENTl,TS,CD9,p27, cyclinGl, pS2,NFKBp65,CYP3A4; ' 

(c) GSTMl, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

10 (d) PR, NMEl, XIAP, upa, cyclinGl , Contig51037, TERC, EPHXl, 
ALDH1A3, CTSL; 

(e) CA9, NMEl, TERC, cyclinGl, EPHXl, DPYD, Src, TOP2B, NFKBp65, 
VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl , p27, CYP3A4, pS2, ErbB3, NFKBp65; 
15 (g) Bcl2,PRAME, cyclinGl, FOXMl,NFKBp65,TS, XIAP, Ki67, CYP3A4, 

p27; 

(h) FOXMl, cyclmGl, XIAP, Contig51037, PRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) PRAME, FOXMl, cycUnGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
20 NFKBp65; 

0) Ki67, XIAP, PRAME, hENTl, contig51037, TS, CD9, p27, ErbB3, cyclinGl; 
(k) STK15, XIAP, PRAIVIE, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1; 
(1) GSTMl, XIAP, PRAME, p27, Contig51037, ErbB3, GSTp, EREG, IDl, 
PLAUR; 

25 (m) PR, PRAME, NMEl, XL\P, PLAUR, cyclinGl, Contig51037, TERC, EPHXl, 
DR5; 

(n) CA9, FOXMl , cyclinGl , XIAP, TS, Ki67, NFKBp65, C YP3 A4, GSTM3, 
p27; 

(o) TFRC, XIAP, PRAME, p27, Contig51037, ErbB3, DPYD, TERC, NMEl, 
30 VEGFC; and 

(p) CEGPl, PRAME, hENTl, XL\P,Contig51037,ErbB3, DPYD, >JFKBp65, 
IDl, TS 
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in a breast cancer tissue sample obtained from said patient, normalized against the expression 
levels of all RNA transcripts or tiieir products in said breast cancer tissue sample, or of a 
reference set of RNA transcripts or their products; 

(2) subjecting the data obtained in step (a) to statistical analysis; and 

(3) determining whether the likelihood of said long-term survival has increased or 
decreased. 

In a still further aspect, the invention concerns a method of predicting the likelihood of 
long-term survival of a patient diagnosed with estrogen receptor (ER)-positive invasive breast 
cancer, without the recurrence of breast cancer, following surgical removal of the primary tumor, 
comprisii^ the steps of: 

(1) determining the expression levels of the RNA transcripts or the expression 
products of genes of a gene set selected from the group consisting of 

(a) FRAME, p27, IGFBP2, HIFIA, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 

(b) Contig51037. EPHXl, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
CYP2C8; 

(c) Bcl2, hENTl, FOXMl, Contig51037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIFIA, FRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, IDl, EstRl , 
DIABLO; 

(e) IGFIR, FRAME, EPHXl, Contig51037, cyclinGl, Bcl2, NMEl, PTEN, TBP, 
TIMP2; 

(f) FOXMl, Contig51037, VEGFC, TBP, HIFIA, DPYD, RAD51C, DCR3, 
cyclinGl, BAGl; 

(g) EPHXl, Contig51037, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pint; 

(i) CDC25B, Contig51037, hENTl, Bcl2, HLAG, TERC, NMEl, upa, IDl, CYP; 
(j) VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

(k) CTSB, FRAME, p27, IGFBP2, EPHXl , CTSL, BAD, DR5, DCR3, XL^; 
(1) DIABLO, Kj67, hENTl, TIMP2, IDl, p27, KRT19, JGFBP2, TS, PDGFB; 
(m) p27, FRAME, IGFBP2, HIFIA, TIMP2, ILT2, CYP3A4, IDl, EstRl , 
DL\BLO; 
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(n) CDHl ; FRAME, VEGFC; HIFl A; DPYD, TIMP2, CYP3 A4, EstRl , RBP4, 
p27; 

(o) IGFBP3, FRAME, p27, Bcl2, XIAP, EstRl , Ki67, TS, Src, VEGF; 

(p) GSTM3, FRAME, p27, IGFBP3, XIAF, FGF2, hENTl , FTEN, EstRl , AFC; 

(q) hENTl , Bcl2, FOXMl , Contig51037, CyclinGl, Contig46653, FTEN, 

CYP3A4, TIMF2, AREG; 
(r) STK15, VEGFC, FRAME, p27, GCLC, hENTl, IDl, TIMP2, EstRl, MCFl; 
(s) NMEl , PRAM, p27, IGFBP3, XIAP, PTEN, hENTl . Bcl2, C YP3A4, HLAG; 
(t) VDR, Bcl2, p27, hENTl, p53, PDKC2A, EIF4E, TFRC, MCM3, IDl ; 
(u) EIF4E, Coiitig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NMEl, 

HER2; 

(v) CCNBl, FRAME, VEGFC, HIFl A, hENTl , GCLC, TIMP2, IDl , p27, upa; 
(w) IDl , FRAME, DIABLO, hENTl, p27, FDGFRa, NMEl , BINl , BRCAl , TP; 
(x). FBX05, FRAME, IGFBF3, p27, GSTM3, hENTl, XIAP, FGF2, TS, FTEN; 
(y) GUS, IHAIA, VEGFC, GSTM3, DPYD, hENTl, EBX05, CA9, CYP, 
KRT18; and 

(2) Bclx, Bcl2, hENTl, ContigS 1037, HLAG, CD9, IDl, BRCAl, BBSfl, 
HBEGF; 

(2) subjecting the data obtained in step (1) to statistical analysis; and 

(3) determining whether the likelihood of said long-tenn survival has increased or 
deceased. 

In a different aspect, the invention concerns an array comprising polynucleotides 
hybridizing to a gene set selected from the group consisting of: 

(a) Bcl2, cyclinGl, NFKBp65, NMEl, EFHXl, TOF2B, DR5, TERC. Src, DL\BLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFBCBp65, CYP3A4; 

(c) GSTMl, XIAP, Ki67,TS, cyclinGl, p27,CYP3A4,pS2,NFKBp65, 
ErbB3; 

(d) PR, NMEl, XLAP, upa, cyclinGl, Contig51037, TERC, EPHXl, 
ALDH1A3, CTSL; 

(e) CA9, NMEl, TERC, cyclinGl, EPHXl, DPYD, Src, TOF2B, NFKBp65, 
VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65; 

(g) Bcl2, FRAME, cyclinGl, FOXMl, NFKBp65, TS, XIAP, Ki67, CYF3A4, 
P27; 

22 

SUBSTITUTE SHEET (RULE 26) 



wo 2003/078662 PCT/US2003/007713 

(h) FOXMl, cyclinGl, XIAP, Contig51037, FRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) FRAME, FOXMl, cyclinGl , XIAF, Contig5 1037, TS, Ki6, PDGFRa, p27, 
NFKBp65; 

0) Ki67, XIAP, FRAME, hENTl , contigS 1 037, TS, CD9, p27, ErbB3, cyclinGl ; 
(k) STKl 5, XIAP, FRAME, PLAUR, p27, CTSL, CDl 8, PREP, p53, RFS6KB1 ; 
(1) GSTMl , XIAP, FRAME, p27, ContigS 1037, ErbB3, GSTp, EREG, IDl, 
PLAUR; 

(m) PR, FRAME, NMEl, XIAP, PLAUR, cyclinGl, ContigS 1037, TERC, EPHxi, 
DR5; 

(n) CA9, FOXMl, cyclinGl, XIAP, TS, Ki67, NFKBp65, CyP3A4, GSTM3, 
p27; 

(o) TFRC, XIAP, FRAME, p27, Contig51037, ErbB3, DPYD, TERC, NMEl, 
VEGFC; and 

(p) CEGPl, FRAME, hENTl, XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 

IDl, TS, 
immobilized on a solid sur^ice. 

In an additional aspect, the invention concerns an array comprising polynucleotides 
hybridizing to a gene set selected from the group consisting of: 

(a) FRAME, p27, IGFBP2, HIFl A, TIMF2, ILT2, CYP3 A4, IDl , EstRl , 
DIABLO; 

(b) Contig51037, EPHXI, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
CYF2C8; 

(c) BcI2, hENTl, FOXMl, Contig51037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIFIA, FRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 

(e) IGFIR, FRAME, EPHXI, Contig51037, cyclinGl, Bcl2, NMEl, PTEN, TBP, 
TIMP2; 

if) FOXMl, Contig51037, VEGFC, TBP, HIFIA, DPYD, RAD51C, DCR3, 
cyclinGl, B AG 1; 

(g) EPHXI, Contig51037, Ki67, TIMF2, cyclmGl, DPYD, CYP3A4, TP, AIBl, 
CYF2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 
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(i) CDC25B, ContigS 1037, hENTl , Bcl2, HLAG, TERC, NMEl , upa, IDl . CYP; 
(j) VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 
(k) CTSB, FRAME, p27, IGFBP2, EPHXl , CTSL, BAD, DR5, DCR3, XIAP; 
(1) DIABLO, Ki67, hENTl, TIMP2, IDl, p27, ICRT19, IGFBP2, TS, PDGFB; 
(m) p27, FRAME, IGFBP2, HIFIA. TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 

(n) CDHl ; FRAME, VEGFC; HIFIA; DPYD, TIMP2, CYP3A4, EstRl , RBP4, 
p27; 

(o) IGFBP3, FRAME, p27, Bcl2, XIAP, EstRl , Ki67, TS, Src, VEGF; 

(p) GSTM3, FRAME, p27, IGFBP3, XL\P, FGF2, hENTl , PTEN, EstRl , AFC; 

(q) hENTl, Bcl2, FOXMl, Contig51037, CyclinGl, Contig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(r) STK15, VEGFC, FRAME, p27, GCLC, hENTl, IDl, TIMP2, EstRl, MCFl ; 
(s) NMEl, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYF3A4, HLAG; 
(t) VDR, Bcl2, p27, hENTl , p53, FDKC2A, EIF4E, TFRC, MCM3, IDl ; 
(u) EIF4E. Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBF, PTEN, NMEl, 

HER2; 

(V) CCNBl, FRAME, VEGFC, HIFIA, hENTl, GCLC, TIMF2, IDl, p27, upa; 
(w) IDl, FRAME, DL\BLO, hENTl, p27, FDGFRa, NMEl, BINl, BRCAl, TP; 
(x) FBX05, FRAME, IGFBF3,p27,GSTM3, hENTl, XL\P,FGF2,TS, PTEN; 
(y) GUS, HIAl A, VEGFC, GSTM3, DPYD, hENTl , FBX05, CA9, CYP, 
KRT18;and 

(z) Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, IDl, BRCAl. BINl, 

HBEGF, 
immobilized on a solid sur&ce. 

In all aspects, the polynucleotides can be cDNAs ("cDNA arrays") that are typically about 
500 to 5000 bases long, although shorter or longer cDNAs can also be used and are within the 
scope of tins invention. Alternatively, the polynucleotids can be oligonucleotides (DNA 
microarrays), which are typically about 20 to 80 bases long, although shorter and longer 
oligonucleotides are also suitable and are within the scope of the invention. The solid surface 
can, for example, be glass or nylon, or any other solid surface typically used in preparii^ arrays, 
such as microarrays, and is typically glass. 
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Brief Description of the Drawings 

Figure 1 is a chart illustrating the overall workflow of the process of the invention for 
measurement of gene expression. In the Figure, FPET stands for **fixed paraffin-embedded 
tissue," and "RT-PCR" stands for "reverse transcriptase PGR." RNA concentration is 
determined by using the commercial RiboGreen^" RNA Quantitation Reagent and Protocol. 

Figure 2 is a flow chart showing the steps of an RNA extraction method according to the 
invention alongside a flow chart of a representative commercial method. 

Figure 3 is a scheme illustrating the steps of an improved method for preparing 
fragmented mRNA for expression profiling analysis. 

Figure 4 illustrates methods for amplification of KlSf A prior to RT-PCR. 

Figure 5 illustrates an alternative scheme for repair and amplification of fi-agmented 
mRNA. 

Figure 6 shows the measurement of estrogen receptor mRNA levels in 40 FPE breast 
cancer specimens via RT-PCR. Three 10 micron sections were used for each measurement. 
Each data point represents the average of triplicate measurements. 

Figure 7 shows the results of the measurement of progesterone receptor mRNA levels in 
40 FPE breast cancer specimens via RT-PCR performed as described in the legend of Figure 6 
above. 

Figure 8 shows results fi-om an IVT/RT-PCR experiment 

Figure 9 is a representation of the expression of 92 genes across 70 FPE breast cancer 
specimens. The y-axis shows expression as cycle threshold times. These genes are a subset of 
the genes listed in Table 1. 

Table 1 shows a breast cancer gene list 

Table 2 sets forth amplicon and primer sequences used for amplification of firagmented 
mRNA. ■ 

Table 3 shows the Accession Nos. and SEQ ID NOS of the breast cance genes examined. 

Detailed Description of the Preferred Embodiment 

A. Definitions 

Unless defined otherwise, technical and scientific , terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Singleton et al. Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & 
Sons (New York, NY 1994), and March, Advanced Organic Chemistry Reactions, Mechanisms 
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and Structure 4th ed., John Wiley & Sons (New York, NY 1992), provide one skilled in the art 
with a general guide to many of the terms used in the present application. 

One skilled in the art will recognize many methods and materials similar or equivalent to 
those described herein, which could be used in the practice of the present invention, hideed, the 
present invention is in no way limited to the methods and materials described. For purposes of 
the present invention, the following terms are defmed below. 

The term "microarray" refers to an ordered arrangement of hybridizable array elements, 
preferably polynucleotide probes, on a substrate. 

The term "polynucleotide," when used in singular or plural, generally refers to any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA, Thus, for instance, polynucleotides as defined herein include, without 
limitation, single- and double-stranded DNA, DNA including single- and double-stranded 
regions, single- and double-stranded RNA, and RNA including single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or include single- and double-stranded regions. Li addition, the term 
"polynucleotide" as used herein refers to triple-stranded regions comprising RNA or DNA or 
both RNA and DNA. The strands in such regions may be from the same molecule or from 
different molecules. The regions may include all of one or more of the molecules, but more 
typically involve only a region of some of the molecules. One of the molecules of a triple-helical 
region often is an oligonucleotide. The term "polynucleotide" specifically includes DNAs and 
R^As that contain one or more modified bases. Thus, DNAs or RNAs with backbones modified 
for stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, 
DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritiated 
bases, are included within the term "polynucleotides" as defined herein. Li general, the term 
"polynucleotide" embraces all chemically, enzymatically and/or metabolically modified forms of 
unmodified polynucleotides, as well as the chemical forms of 1I)NA and RNA characteristic of 
viruses and cells, including simple and complex cells. 

TTie term "oligonucleotide" refers to a relatively short polynucleotide, including, without 
limitation, single-stranded deoxyribonucleotides, single- or double-stranded ribonucleotides, 
RNAiDNA hybrids and double-stranded DNAs. Oligonucleotides, such as single-stranded DNA 
probe oligonucleotides, are often synthesized by chemical methods, for example using automated 
oligonucleotide synthesizers that are commercially available. However, oligonucleotides can be 
made by a variety of other methods, including in vitro recombinant DNA-mediated techniques 
. and by expression of DNAs in cells and organisms. 
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The tenns "differentially expressed gene," "differential gene expression" and their 
synonyms, which are used interchangeably, refer to a gene whose expression is activated to a 
higher or lower level in a subject suffering from a disease, specifically cancer, such as breast 
cancer, relative to its expression in a normal or control subject. The terms also include genes 
5 whose expression is activated to a higher or lower level at different stages of the same disease. It 
is also understood that a differentially expressed gene may be either activated or inhibited at the 
nucleic acid level or protein level, or may be subject to alternative splicing to result in a different 
polypeptide product Such differences may be evidenced by a change in mRNA levels, surface 
expression, secretion or other partitioning of a polypeptide, for example. Differential gene 

10 expression may include a comparison of expression between two or more genes, or a comparison 
of the ratios of the expression between two or more genes, or even a comparison of two 
differently processed products of the same gene, which differ between normal subjects and 
subjects suffering from a disease, . specifically cancer, or between various stages of the same 
disease. Differential expression includes both quantitative, as well as qualitative, differences in 

15 the temporal or cellular expression pattern in a gene or its expression products among, for 
example, normal and diseased cells, or among cells which have undergone different disease 
events or disease stages. For the purpose of this invention, "differential gene expression" is 
considered to be present when there is at least an about two-fold, preferably at least about four- 
fold, more preferably at least about six-fold, most preferably at least about ten-fold difference 

20 between the expression of a given gene in normal and diseased subjects, or in various stages of 
disease development in a diseased subject 

The phrase "gene amplification" refers to a process by which multiple copies of a gene or 
gene fragment are formed in a particular cell or cell line. The duplicated region (a stretch of 
amplified DNA) is often referred to as "amplicon." Usually, the amount of the messenger RNA 
. 25 (mRNA) produced, Le., the level of gene expression, also increases in the proportion of the 
number of copies made of the particular gene expressed^ 

The term "prognosis" is used herein to refer to the prediction of the likelihood of cancer- 
attributable death or progression, including recunrence, metastatic spread, and drug resistance, of 
a neoplastic disease, such as breast cancer. The term "prediction" is used herein to refer to the 

30 likelihood that a patient will respond either favorably or unfavorably to a drug or set of drugs, 
and also the extent of those responses. The predictive methods of the present invention can be 
used clinically to make treatment decisions by choosing the most appropriate treatment 
modalities for any particular patient. The predictive methods of the present invention are 
valuable tools in predicting if a patient is likely to respond favorably to a treatment regimen, such 
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as surgical intervention, chemotherapy with a given drug or drug combination, and/or radiation 
therapy. 

The term "increased resistance" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the drug 
or to a standard treatment protocol. 

The term "decreased sensitivity" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the drug 
or to a standard treatment protocol, where decreased response can be compensated for (at least 
partially) by increasing the dose of drug, or the intensity of treatment 

"Patient response" can be assessed using any endpoint indicating a benefit to the patient, 
including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing 
down and complete growtti arrest; (2) reduction in the nvmiber of tumor cells; (3) reduction in 
tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell 
infiltration into adjacent peripheral organs and/or tissues; (5) inhibition (i.e, reduction, slowing 
down or complete stopping) of metastasis; (6) enhancement of anti-tumor immune response, 
which may, but does not have to, result in the regression or rejection of the tumor; (7) relief, to 
some extent, of one or more symptoms associated with the tumor; (8) increasb in the length of 
survival following treatment; and/or (9) decreased mortality at a given point of time following 
treatment. 

The term 'treatment" refers to both therapeutic treatment and prophylactic or preventative 
measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic 
condition or disorder. Those in need of treatment include those already with the disorder as well 
as those prone to have the disorder or those in whom the disorder is to be prevented. In tumor 
(e.g., cancer) treatment, a therapeutic agent may directly decrease the pathology of tumor cells, or 
render the tumor cells more susceptible to treatment by other therapeutic agents, e.g.^ radiation 
and/or chemotherapy. 

The term "tumor," as used herein, refers to all neoplastic cell growth and proliferation, 
whether malignant or benign, and all pre-cancerous and cancerous cells and tissues. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in 
mammals that is typically characterized by unregulated cell growth. Examples of cancer include 
but are not limited to, breast cancer, colon cancer, limg cancer, prostate cancer, hepatocellular 
cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder 
cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain 
cancer, 
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The "pathology" of cancer includes all phenomena that compromise the well-being of the 
patient. . This includes, without limitation, abnormal or uncontrollable cell growth, metastasis, 
interference with the normal functioning of neighboring cells, release of cytokines or other 
secretory products at abnormal levels, suppression or aggravation of inflanmiatory or 
5 immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or 
distant tissues or organs, such as lymph nodes, etc. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in 
the art, and generally is an empirical calculation dependent upon probe length, washing 
temperature, and salt concentration. In general, longer probes require higher temperatures for 

10 proper aimealing, while shorter probes need lower temperatures. Hybridization generally 
depends on the ability of denatured DNA to reanneal when complementary strands are present in 
an environment below their melting temperature. The higher the degree of desired homology 
between the probe and hybridizable sequence, the higher the relative temperature which can be 
used. As a result, it follows that higher relative temperatures would tend to make the reaction 

15 conditions more stringent, while lower temperatures less so. For additional details and 
explanation of stringency of hybridization reactions, see Ausubel et al.. Current Protocols in 
Molecular Biology^ Wiley Literscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, typically: (1) 
employ low ionic strength and high temperature for washing, for example 0.015 M sodium 

20 chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50^C; (2) employ during 
hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 
0.1% bovine serum aIbumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate 
buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 
50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 

25 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 
Jig/ml), 0.1% SDS, and 10% dextran sulfate at 42*'C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55^*0, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55'*C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., 

30 Molecular Cloning: A Laboratory Manual , New York: Cold Spring Harbor Press, 1989, and 

include the use of washing solution and hybridization conditions (e.g., temperature, ionic 

strength and %SDS) less stringent that those described above. An example of moderately 

stringent conditions is overnight incubation at 37°C in a solution comprising: 20% formamide, 5 

X SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 x 

29 

SUBSTITUTE SHEET (RULE 26) 



wo 2003/078662 PCT/US2003/007713 

Denhardt's solution, 10% dextran siilfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize 
how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as 
probe length and the like. 

5 In the context of the present invention, reference to "at least one," "at least two," "at least 

five," etc. of the genes listed in any particular gene set means any one or any and all 
combinations of the genes listed. 

The terms "splicing" and "RNA splicing" are used interchangeably and refer to RNA 
processing that removes introns and joins exons to produce mature miRNA with continuous 
10 coding sequence that moves into the cytoplasm of an eukaiyotic cell. 

In theory, the term "exon" refers to any segment of an interrupted gene that is represented 
in the mature RNA product (B. Lewin. Genes IV Cell Press, Cambridge Mass. 1990). In theory 
the term "intron" refers to any segment of DN A that is transcribed but removed from within the 
transcript by splicing together the exons on either side of it. Operationally, exon sequences occur 
15 in the mRNA sequence of a gene as defined by Ref Seq ID numbers. Operationally, intron 
sequences are the intervening sequences within the genomic DNA of a gene, bracketed by exon 
sequences and having GT and AG splice consensus sequences at their 5' and 3' boundaries. 
B, Detailed Description 

The practice of the present, invention will employ, unless otherwise indicated, 
20 conventional techniques of molecular biology (including recombinant techniques), microbiology, 
cell biology, and biochemistry, which are within the skill of the art. Such techniques are 
explained fully in the literature, such as, "Molecular Cloning: A Laboratory Manual", 2"*^ edition 
(Sambrook et al, 1989); "Oligonucleotide Synthesis" (MJ. Gait, ed., 1984); "Animal Cell 
Culture" (R.L Freshney, ed., 1987); "Methods in Enzymology" (Academic Press, Inc.); 
25 "Handbook of Experimental Immunology", 4* edition (D,M. Weir & C.C. Blackwell, eds-, 
Blackwell Science Inc., 1987); "Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & 
M,P. Calos, eds,, 1987); "Current Protocols in Molecular Biology" (F.M. Ausubel et al-, eds-, 
1987); and "PCR: The Polymerase Chain Reaction", (Mullis et al„ eds., 1994). 
1. Gene Expression Proiilinsi 
30 In general, methods of gene expression profiling can be divided into two large groups: 

methods based on hybridization analysis of polynucleotides, and methods based on sequencing of 
polynucleotides. The most commonly used methods known in the art for the quantification of 
. mRNA expression in a sample include northern blotting and in situ hybridization (Parker & 
Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse protection assays (Hod, 
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Biotechniques 13:852-854 (1992)); and reverse transcription polymerase chain reaction (RT- 
PGR) (Weis et aU Trends in Genetics 8:263-264 (1992)). Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and 
DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing- 
5 based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene 
expression analysis by massively parallel signature sequencing (MPSS). 
2. Reverse Transcriptase PCR fRT-PCR) 

Of the techniques listed above, the most sensitive and most flexible quantitative method is 
RT-PCR, which can be used to compare mRNA levels in different sample populations, in normal 

10 and tumor tissues, with or without drug treatment, to characterize patterns of gene expression, to 
discriminate between closely related mRNAs, and to analyze RNA structure. 

The first step is the isolation of mRNA jfrom a target sample. The starting material is 
typically total RNA isolated firom human tumors or tumor cell lines, and corresponding normal 
tissues or cell lines, respectively. Thus RNA can be isolated from a variety of primary tumors, 

15 including breast, lung, colon, prostate, brain, liver, kidney, pancreas, spleen, thymus, testis, 
ovary, uterus, etc., tumor, or tumor cell lines, with pooled DNA from healthy donors. If the 
source of mRNA is a primary tumor, mRNA can be extracted, for example, from frozen or 
archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples. 

General methods for mRNA extraction are well known in the art and are disclosed hi 

20 standard textbooks of molecular biology, including Ausubel et aL^ Cxm^ent Protocols of 
Molecular Biology , John Wiley and Sons (1997). Methods for RNA extraction from paraffin 
embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest 56:A67 (1987), 
and De Andres et al, BioTechniques 18:42044 (1995). In particular, RNA isolation can be 
performed using purification kit, buffer set and protease from commercial manufacturers, such as 

25 Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in 
culture can be isolated using Qiagen RNeasy mini-columns. Oflier commercially available RNA 
isolation kits include MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI), and Paraffin Block RNA Isolation Kit (Ambion, Inc.)- Total RNA from tissue 
samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumor can be 

30 isolated, for example, by cesium chloride density gradient centrifiigation. 

As RNA cannot serve as a template for PCR, the first step in gene expression profiling by 
RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its exponential 
amplification in a PCR reaction. The two most commonly used reverse transcriptases are avilo 
myeloblastosis vims reverse transcriptase (AMV-RT) and Moloney murine leukemia virus 
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reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using 
specific primers, random hexamers, or oligo-dT primers, depending on the circumstances- and the 
goal of expression profiling. For example, extracted RNA can be reverse-transcribed using a 
. Gene Amp RNA PGR kit (Perkin Elmer, CA, USA), following the manufacturer's instractions. 
5 The derived cDNA can then be used as a template in the subsequent PGR reaction. 

Although the PGR step can use a variety of thermostable DNA-dependent DNA 
polymerases, it typically employs the Taq DNA polymerase, which has a 5'-3' nuclease activity 
but lacks a 3'-5' proofireadmg endonuclease activity. Thus, TaqMan® PGR typically utilizes the 
5 '-nuclease activity of Taq or Ttfa polymerase to hydrolyze a hybridization probe bound to its 

10 target amplicon, but any enzyme with equivalent 5' nuclease activity can be used. Two 
oligonucleotide primers are used to generate an amplicon typical of a PGR reaction. A third 
oligonucleotide, or probe, is designed to detect nucleotide sequence located between the two 
PGR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with 
a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission firom the 

15 reporter dye is quenched by the quenching dye when the two dyes are located close together as 
they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme 
cleaves the probe in a template-dependent manner. The resultant probe fi-agments disassociate in 
solution, and signal fi^om the released reporter dye is fi-ee JBrom the quenching effect of the second 
fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and 

20 detection of the unquenched reporter dye provides the basis for quantitative interpretation of the 
data. 

TaqMan® RT-PGR can be performed using commercially available equipment, such as, 
for example, ABI PRISM 7700™ Sequence Detection System'^ (Perkin-Ehner-Applied 
Biosystems, Foster Gity, GA, USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, 

25 Germany). In a preferred embodiment, the 5' nuclease procedure is run on a real-time 
quantitative PGR device such as the ABI PRISM 7700™ Sequence Detection System™. The 
system consists of a thermocycler, laser, charge-coupled device (GCD), camera and computer. 
The system amplifies samples in a 96-well format on a thermocycler. During amplification, 
laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 96 

30 wells, and detected at the CGD. The system includes software for running the instrument and for 
analyzing the data. 

5 -Nuclease assay data are initially expressed as Gt, or the threshold cycle. As discussed 
above, fluorescence values are recorded during every cycle and represent the amount of product 
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amplified to that point in the amplification reaction. The point when the fluorescent signal is 
first recorded as statistically significant is the threshold cycle (Ct). 

To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually 
performed using an internal standard. The ideal internal standard is expressed at a constant level 
5 among different tissues, and is unaffected by the experimental treatment. RNAs most firequently 
used to normalize patterns of gene expression are mRNAs for the housekeeping genes 
glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and P-actin. 

A more recent variation of the RT-PCR technique is the real time quantitative PGR, 
which measures PGR product accumulation throu^ a dual-labeled fluorigenic probe (i.e., 
10 TaqMan® probe). Real time PGR is compatible both with quantitative competitive PGR, where 
internal competitor for each target sequence is used for normalization, and with quantitative 
comparative PGR using a normalization gene contained within the sample, or a housekeeping 
gene for RT-PGR. For further details see, e.g. Held etal, Genome Research 6:986-994 (1996). 

3. Microarravs 

.15 Differential gene expression can also be identified, or confirmed using the microarray 

technique. Thus, the expression profile of breast cancer-associated genes can be measured in 
either fi^sh or paraffin-embedded tumor tissue, using microarray technology. In this method, 
polynucleotide sequences of interest are plated, or arrayed, on a microchip substrate. The arrayed 
sequences are flien hybridized with specific DNA probes fi-om cells or tissues of interest. Just as 

20 in the RT-PGR method, the source of mRNA typically is total RNA isolated from human tumors 
or tumor cell lines, and corresponding normal tissues or cell lines. Thus RNA can be isolated 
firom a variety of primary tumors or tumor cell lines. If the source of mRNA is a primary tumor, 
mRNA can be extracted, for example, firom fi-ozen or archived paraffin-embedded and fixed (e.g. 
formalin-fixed) tissue samples, which are routinely prepared and preserved in everyday clinical 

25 practice. 

In a specific embodiment of the microarray technique, PGR amplified inserts of cDNA 
clones are applied to a substrate in a dense array. Preferably at least 10,000 nucleotide sequences 
are applied to the substrate. The microarrayed genes, immobilized on the microchip at 10,000 
elements each, are suitable for hybridization under stringent conditions. Fluorescently labeled 
30 cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse 
transcription of RNA extracted from tissues of interest Labeled cDNA probes applied to the 
chip hybridize with specificity to each spot of DNA on the array. Afl:er stringent washing to 
remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by 
another detection method, such as a GGD camera. Quantitation of hybridization of each arrayed 
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element allows for assessment of corresponding mRNA abimdance. With dual color 
fluorescence, separately labeled cDNA probes generated from two sources of RNA are 
hybridized pauwise to the array. The relative abundance of the transcripts from the two sources 
corresponding to each specified gene is thus determined simultaneously. The miniaturized scale 
of the hybridization affords a convenient and rapid evaluation of the expression pattern for large 
numbers of genes. Such methods have been shown to have the sensitivity required to detect rare 
transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least 
approximately two-fold differences in the expression levels (Schena et al, Proc. Natl Acad. Set 
USA 93(2): 106- 149 (1996)). Microarray analysis can be performed by commercially available 
equipment, following manufacturer's protocols, such as by using the Afifymetrix GenChip 
technology, or Incyte's microarray technology. 

The development of microarray methods for large-scale analysis of gene expression 
makes it possible to search systematically for molecular markers of cancer classification and 
outcome prediction in a variety of tumor types. 

4. Serial Analysis of Gene Expression (SAGE) 

Serial analysis of gene expression (SAGE) is a method that allows the simultaneous and 
quantitative- analysis of a large number of gene transcripts, without the need of providing an 
individxial hybridization probe for each transcript. First, a short sequence tag (about 10-14 bp) is 
generated that contains sufficient information to uniquely identify a transcript, provided that the 
tag is obtained from a unique position within each transcript. Then, many transcripts are linked 
together to form long serial molecules, that can be sequenced, revealing the identity of the 
multiple tags simultaneously. The expression pattern of any population of transcripts can be 
quantitatively evaluated by determining the abundance of individual t£^s, and identifying the 
gene corresponding to each tag. For more details see, e.g. Velculescu et al. Science 270:484-487 
(1 995); and Velculescu et al. Cell 88:243-51 (1997). 

5. Gene Expression Analysis bv Massively Parallel Signature Seauencins (MPSS) 
This method, described by Brenner et al. Nature Biotechnology 18:630-634 (2000), is a 

sequencing approach that combines non-gel-based signature sequencing with in vitro cloning of 
millions of templates on separate 5 jun diameter microbeads. First, a microbead library of DNA 
templates is constructed by in vitro cloning. This is followed by the assembly of a planar array of 
the template-containing microbeads in a flow cell at a high density (typically greater than 3x10^ 
microbeads/cm^). The free ends of the cloned templates on each microbead are analyzed 
simultaneously, using a fluorescence-based signature sequencing method that does not require 
DNA fragment separation. This method has been shown to simultaneously and accurately 
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provide, in a single operation, himdreds of thousands of gene signature sequences from a yeast 
cDNA library. 

6. General Description of the rnRNA Isolation, Purification and Amvlificatinn 
Methods of the Invention 

5 The steps of a representative protocol of the invention, including mRNA isolation, 

purification, primer extension and amplification are illustrated in Figure L As shown in Figure 
1, this representative process starts with cutting about 10 jjxcl thick sections of paraffin-embedded 
tumor tissue samples. The RNA is then extracted, and protein and DNA are removed, following 
the method of tiie invention described below. After analysis of the RNA concentration, RNA 

10 repgdr and/or amplification steps may be included, if necessary, and RNA is reverse transcribed 
using gene specific promoters followed by RT-PCR. Finally, the data are analyzed to identify the 
best treatment option(s) available to the patient on the basis of the characteristic gene expression 
pattern identified in the tumor sample examined. The individual steps of this protocol will be 
discussed in greater detail below. 

15 7. Improved Method for Isolation of Nucleic Acid from Archived Tissue Specimens 

As discussed above, in the first step of the method of the invention, total RNA is extracted 
from the source material of interest, including fixed, parafiSn-embedded tissue specimens, and 
purified sufficiently to act as a substrate in an enzyme assay. Despite the availability of 
commercial products, and the extensive knowledge available concerning the isolation of nucleic 

20 acid, such as RNA, from tissues, isolation of nucleic acid (RNA) from fixed, paraffin-embedded 
tissue specimens (FPET) is not without difficulty. 

In one aspect, the present invention concerns an improved method for the isolation of 
nucleic acid from archived, e.g. FPET tissue specimens. Measured levels of mRNA species are 
useful for defining the physiological or pathological status of cells and tissues. RT-PCR (which 

25 is discussed above) is one of the most sensitive, reproducible and quantitative methods for this 
"gene expression profilmg". Paraffin-embedded, formalin-fixed tissue is the most widely 
available material for such studies. Several laboratories have demonstrated that it is possible to 
successfidly use fixed-parafifin-embedded tissue (FPET) as a source of RNA for RT-PCR (Stanta 
et al, Biotechniques 11:304-308 (1991); Stanta et aU Methods Mol Biol 86:23-26 (1998); 

30 Jackson et ai. Lancet 1:1391 (1989); Jackson et al, J. Clin. Pathol 43:499-504 (1999); Finke et 
al, Biotechniques 14:448-453 (1993); Goldsworthy et aU Mol Carcinog, 25:86-91 (1999); 
Stanta and Bonin, Biotechniques 24:271-276 (1998); Godfirey et al, J. Mol Diagnostics 2:84 

(2000) ; Specht et al, J. Mol Med 78:B27 (2000); Specht et al. Am. J. Pathol 158:419-429 

(2001) )- This allows gene expression profiling to be carried out on the most commonly available 
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source of human biopsy specimens, and therefore potentially to create new valuable diagnostic 
and therapeutic information. 

The most widely used protocols utilize hazardous organic solvents, such as xylene, or 
octane (Finke et al, supra) to dewax the tissue in the paraffin blocks before nucleic acid (RNA 
and/or DNA) extraction. Obligatory organic solvent removal (e.g. with ethanol) and rehydration 
steps follow, which necessitate multiple manipulations, and addition of substantial total time io 
the protocol, which can take up to several days. Commercial kits and protocols for RNA 
extraction from FPET [MasterPure'^" Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI); Paraffin Block RNA Isolation Kit (Ambion, Inc.) and RNeasy™ Mini kit 
(Qiagen, Chatsworth, CA)] use xylene for deparaffinization, in procedures which typically 
require multiple centrifugations and ethanol buffer changes, and incubations following 
incubation with xylene. 

The present invention provides an improved nucleic acid extraction protocol that produces 
nucleic acid, in particular RNA, sufficiently intact for gene expression measurements. The key 
step in the nucleic acid extraction protocol herein is the performance of dewaxing without the use 
of any organic solvent, thereby eliminating the need for multiple manipulations associated with 
the removal of the organic solvent, and substantially reducing the total time to the protocol. 
According to the invention, wax, e.g. paraffin is removed from wax-embedded tissue samples by 
incubation at 65-75 ""C in a lysis buffer that solubilizes the tissue and hydrolyzes the protein, 
following by cooling to solidify the wax. 

Figure 2 shows a flow chart of an RNA extraction protocol of the present invention in 
comparison with a representative commercial method, using xylene to remove wax. The times 
required for individual steps in the processes and for the overall processes are shown in the chart. 
As shown, the commercial process requires approximately 50% more time than the process of the 
invention. 

The lysis buffer can be any buffer known for cell lysis. It is, however, preferred that 
oligo-dT-based methods of selectively purifying polyadenylated mRNA not be used to isolate 
RNA for the present invention, since the bulk of the mRNA molecules are expected to be 
fragmented and therefore will not have an intact polyadenylated tail, and will not be recovered or 
available for subsequent analytical assays. Otherwise, any niunber of standard nucleic acid 
purification schemes can be used. These include chaotrope and organic solvent extractions, 
extraction using glass beads or filters, salting out and precipitation based methods, or any of the 
purification methods known in the art to recover total RNA or total nucleic acids from a 
biological source. 
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Lysis buffers are commercially available, such as, for example, from Qiagen, Epicentre, or 
Ambion. A preferred group of lysis buffers typically contains urea, and Proteinase K or other 
protease. Proteinase K is very useful in the isolation of high quality, undamaged DNA or RNA, 
since most manmialian DNases and RNases are rapidly inactivated by this en2yme, especially in 
the presence of 0.5 - 1% sodium dodecyl sulfate (SDS). This is particularly important in the 
case of RNA, which is more susceptible to degradation than DNA. While DNases require metal 
ions for activity, and can therefore be easily inactivated by chelating agents, such as EDTA^ there 
is no similar co-factor requirement for RNases, 

Cooling and resultant solidification of the wax permits easy separation of the wax from 
the total nucleic acid, which can be conveniently precipitated, e.g. by isopropanoL Further 
processing depends on the intended purpose. If the proposed method of RNA analysis is subject 
to bias by contaminating DNA in an extract, the RNA extract can be further treated, e.g. by 
DNase, post purification to specifically remove DNA while preserving RNA. For example, if the 
goal is to isolate high quality RNA for subsequent RT-PCR amplification, nucleic acid 
precipitation is followed by the removal of DNA, xisually by DNase treatment. However, DNA 
can be removed at various stages of nucleic acid isolation, by DNase or other techniques well 
known in the art. 

While the advantages of the nucleic acid extraction protocol of the invention are most 
apparent for the isolation of RNA from archived, paraffin embedded tissue samples, the wax 
removal step of the present invention, which does not involve the use of an organic solvent, can 
also be included in any conventional protocol for the extraction of total nucleic acid (RNA and 
DNA) or DNA only. All of these aspects are specifically within the scope of Ihe invention. 

By using heat followed by cooling to remove paraffin, the process of the present invention 
saves valuable processing time^ and eliminates a series of manipulations, thereby potentially 
increasing the yield of nucleic acid. Indeed, experimental evidence presented in the examples 
below, demonstrates that the method of the present invention does not compromise RNA yield. 

8. 5 -multiplexed Gene Specific Primins of Reverse Transcription 

RT-PCR requires reverse transcription of the test RNA population as a first step. The 
most commonly used primer for reverse transcription is oligo-dT, which works well when RNA 
is intact. However, this primer will not be effective when RNA is highly fragmented as is the 
case in FPE tissues. 

The present invention includes the use of gene specific primers, which are roughly 20 
bases in length with a Tm optunum between about 58 and 60 ^^C. These primers will also 
serve as the reverse primers that drive PGR DNA amplification. 
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Another aspect of the invention is the inclusion of multiple gene-specific primers in the 
same reaction mixture. The number of such different primers can vary greatly and can be as low 
as two and as high as 40,000 or more. Table 2 displays examples of reverse primers that can be 
successfully used in carrying out the methods of the invention. Figure 9 show^s expression data 
5 obtained using this multiplexed gene-specific priming strategy. Specifically, Figure 9 is a 
representation of the expression of 92 genes (a subset of genes listed in Table 1) across 70 FPE 
breast cancer specimens. The y-axis shows expression as cycle threshold times. 

An alternative approach is based on the use of random hexamers as primers for cDN A 
synthesis. However, we have experimentally demonstrated that the method of using a 
10 multiplicity of gene-specific primers is superior over the known approach using random 
hexamers. 

9. Preparation of Fragmented mRNA for Expression Profilins^ Assays 
It is of interest to analyze the abundance of specific mRNA species in biological samples, 
since this expression profile provides an index of the physiological state of that sample. mRNA 
15 is notoriously difficult to extract and maintain in its native state, consequently, mRNA recovered 
fi-om biological sources is often fiagmented or somewhat degraded. This is especially true of 
human tissue specimen which have been chemically fixed and stored for extended periods of 
time. 

In one aspect, the present invention provides a means of preparing the mRNA extracted 
20 firom various sources, including archived tissue specimens, for expression profiling in a way that 
its relative abundance is preserved and the mRNA's of interest can be successfially measured. 
This method is useful as a means of preparing mRNA for analysis by any of the known 
expression profiling methods, including RT-PCR coupled with 5' exonuclease of reporter probes 
(TaqMan® type assays), as discussed above, flap endonuclease assays (Cleavase® and Invader® 
25 type assays), oligonucleotide hybridization arrays, cDNA hybridization arrays, oligonucleotide 
ligation assays, 3' single nucleotide extension assays and other assays designed to assess the 
abundance of specific naRNA sequences in a biological sample. 

According to the method of the invention, total RNA is extracted fi"om the source material 
and sufficiently purified to act as a substrate in an enzyme assay. The extraction procedure, 
30 including a new and improved way of removing the wax (e.g. paraffin) used for embedding the 
tissue samples, has been discussed above. It has also been noted that it is preferred that oligo-dT 
based methods of selectively purifying polyadenylated mRNA not be used to isolate RNA for this 
invention since the bulk of the mRNA is expected to be firagmented, will not be polyadenylated 
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and, therefore, will not be recovered and available for subsequent analytical assays if an oligo-dT 
based method is used. 

A diagram of an improved method for repairing fragmented RNA is shown in Figure 3. 
The fragmented RNA purified from the tissue sample is mixed with universal or gene-specific, 
5 single-stranded, DNA templates for each mRNA species of interest. These templates may be full 
length DNA copies of the mRNA derived from cloned gene sources, they may be fragments of 
the gene representing only the segment of the gene to be assayed, they may be a series of long 
oligonucleotides representing either the full length gene or the specific segment(s) of interest. 
The template can represent either a single consensus sequence or be a mixture of polymorphic 

10 variants of the gene. This DNA template, or scaffold, will preferably include one or more dUTP 
or rNTP sites in its length. This will provide a means of removing the template prior to carrying 
out subsequent analytical steps to avoid its acting as a substrate or target in later analysis assays. 
This removal is accomplished by treating the sample with uracil-DNA glycosylase (UDG) and 
heating it to cause strand breaks where UDG has generated abasic sites. In the case of rNTP's, 

15 the sample can be heated in the presence of a basic buffer (pH -10) to induce strand breaks 
where rNTP's are located in the template. 

The single stranded DNA template is mixed with the purified RNA, the mixture is 
denatured and aimealed so that the RNA fragments complementary to the DNA template 
effectively become primers that can be extended along the single stranded DNA templates. DNA 

20 polymerase I requires a primer for extension but will efficiently use either a DNA or an RNA 
primer. Therefore in the presence of DNA polymerase I and dNTP's, the fragmented RNA can 
be extended along the complementary DNA templates. In order to increase the efficiency of the 
extension, this reaction can be thermally cycled, allowing overlapping templates and extension 
products to hybridize and extend until the overall population of fragmented RNA becomes 

25 represented as double stranded DNA extended from RNA fragment primers. 

Following the generation of this "repaired" RNA, the sample should be treated with UDG 
or heat-treated in a mildly based solution to fragment the DNA template (scaffold) and prevent it 
from participating in subsequent analytical reactions. 

The product resulting from this enzyme extension can then be used as a template in a 

30 standard enzyme profiling assay that includes amplification and detectable signal generation such 
as fluorescent, chemiluminescent, colorimetric or other common read outs from enzyme based 
assays. For example, for TaqMan® type assays, this double stranded DNA product is added as 
the template in a standard assay; and, for array hybridization, this product acts as the cDNA 
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template for the cRNA labeling reaction typically used to generate single-stranded, labeled RNA 
for array hybridization. 

This method of preparing template has the advantage of recovering information from 
mRNA fragments too short to effectively act as templates in standard cDNA generation schemes. 
5 In addition, this method acts to preserve the specific locations in mRNA sequences targeted by 
specific analysis assays. For example, TaqMan® assays rely on a single contiguous sequence in 
a cDNA copy of mRNA to act as a PGR amplification template targeted by a labeled reporter 
probe. If mRNA strand breaks occur in this sequence, the assay will not detect that template and 
will imderestimate ttie quantity of that mRNA in the original sample. This target preparation 

1 0 method minimizes that effect from RNA fragmentation. 

The extension product formed in the RNA primer extension assay can be controlled by 
controlling the input quantity of the single stranded DNA template and by doing limited cycling 
of the extension reaction. This is important in preserving the relative abundance of the mRNA 
sequences targeted for analysis. 

15 This method has the added advantage of not requiring parallel preparation for each target 

sequence since it is easily multiplexed. It is also possible to use large pools of random sequence 
long oligonucleotides or fiill libraries of cloned sequences to extend the entire population of 
mRNA sequences in the sample extract for whole expressed genome analysis rather than targeted 
gene specific analysis. 

20 10. Amplification of mRNA Svecies Prior to RT-PCR 

Due to the limited amount and poor quality of mRNA that can be isolated from FPET, a 
new procedure that could accurately amplify mRNAs of interest would be very usefijl, 
particularly for real time quantitation of gene expression (TaqMan®) and especially for 
quantitatively large number (>50) of genes >50 to 10,000. 

25 Current protocols (e.g. Eberwine, Biotechniques 20:584-91 (1996)) are optimized for 

naRNA amplification from small amount of total or poly A^ RNA mainly for microarray analysis. 
The present invention provides a protocol optimized for amplification of small amounts of 
fragmented total RNA (average size about 60-150 bps), utilizing gene-specific sequences as 
primers, as illustrated in Figure 4. 

30 The amplification procedure of the invention uses a very large number, typically as many 

as 100 " 190,000 gene specific primers (GSP's) in one reverse transcription run. Each GSP 
contains an RNA polymerase promoter, e.g. a T7 DNA-dependent RNA polymerase promoter, at 
the 5' end for subsequent RNA amplification. GSP's are preferred as primers because of the 
small size of the RNA. Current protocols utilize dT primers, which would not adequately 
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represent all reverse transcripts of mRNAs due to the small size of the FPET RNA, GSP's can be 
designed by optimizing usual parameters, such as length, Tm, etc. For example, GSP's can be 
designed using the Primer Express® (Applied Biosystems), or Primer 3 (MIT) software program. 
Typically at least 3 sets per gene are designed, and the ones giving the lowest Ct on FPET RNA 
5 (best performers) are selected. 

Second . strand cDNA synthesis is performed by standard procedures (see Figure 4, 
Method 1), or by GSPf primers and Taq pol under PGR conditions (e.g., 95 °C, 10 min (Taq 
activation) then 60 ^C, 45 sec). The advantages of the latter method are that the second gene 
specific primer-^ SGFf adds additional specificity (and potentially more efficient second strand 
10 synthesis) and the option of perfomiing several cycles of PGR, if more starting DNA is necessary 
for RNA amplification by T7 RNA polymerase. RNA amplification is then performed imder 
standard conditions to generate multiple copies of cRNA, which is then used in a standard 
TaqMan® reaction. 

Although this process is illustrated by using T7-based RNA amplification, a person 

15 ' skilled in the art will understand that other RNA polymerase promoters that do not require a 
primer, such as T3 or Sp6 can also be used, and are within the scope of the invention. 

11. A method of Elongation of Fragmented RNA and Subsequent Amplification 
This method, which combines and modifies the inventions described in sections 9 and 10 
above, is illustrated in Figure 5. The procedure begins with elongation of fiagmented mRNA. 

20 This occurs as described above except that the scaffold DNAs are tagged with the T7 RNA 
polymerase promoter sequence at their 5' ends, leading to double-stranded DNA extended from 
RNA firagments. The template sequences need to be removed after in vitro transcription. These 
templates can include dUTP or rNTP nucleotides, enabling enzymatic removal of the templates 
as described in section 9, or the templates can be removed by DNasel treatment. 

25 The template DNA can be a population representing different mRNAs of any number. A 

high sequence complexity source of DNA templates (scaffolds) can be generated by pooling 
RNA firom a variety of cells or tissues. In one embodiment, these RNAs are converted into 
double stranded DNA and cloned into phagemids. Single stranded DNA can then be rescued by 
phagemid growth and single stranded DNA isolation fi-om purified phagemids. 

30 This invention is useful because it increases gene expression profile signals two different 

ways: both by increasing test mRNA polynucleotide sequence length and by in vitro transcription 
amplification. An additional advantage is that it eliminates the need to carry out reverse 
transcription optimization with gene specific primers tagged with the T7 RNA polymerase 
promoter sequence, and thus, is comparatively fast and economical. 
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This invention can be used with a variely of different methods to profile gene expression, 
e.g., RT-PCR or a variety of DNA array methods. Just as in the previous protocol, this approach 
is illustrated by using a T7 pronioter but the invention is not so limited. A person skilled in the 
art will appreciate, however, that other RNA polymerase promoters, such as T3 or Sp6 can also 
.5 be used. 

12. Breast Cancer Gene Set Assayed Gene Subsequences, and Clinical Application o f 
Gene Expression Data 

An important aspect of the present invention is to use the measured expression of certain 
genes by breast cancer tissue to match patients to best drugs or drug combinations, and to provide 

10 prognostic information. For this purpose it is necessary to correct for (normalize away) both 
differences in the amoimt of RNA assayed and variability in the quality of the RNA used. 
Therefore, the assay measures and incorporates the expression of certain normalizing genes, 
including well known housekeeping genes, such as GAPDH and Cypl. Alternatively, 
normalization can be based on the mean or median signal (Ct) of all of the assayed genes or a 

15 large subset thereof (global normalization approach). On a gene-by-gene basis, measured 
normalized amount of a patient tumor mRNA is compared to the amount found in a breast cancer 
) tissue reference set The number (N) of breast cancer tissues in this reference set should be 
sufficiently high to ensure that different reference sets (as a whole) behave essentially the same 
way. If this condition is met, the identity of the individual breast cancer tissues present in a 

20 particular set will have no significant impact on the relative amoxmts of the genes assayed. 
Usually, the breast cancer tissue reference set consists of at least about 30, preferably at least 
about 40 different FPE breast cancer tissue specimens. Unless noted otherwise, normalized 
expression levels for each mRNA/tested tumor/patient will be expressed as a percentage of the 
expression level measured in the reference set. More specifically, the reference set of a 

25 sufficiently high number (e.g. 40) tumors yields a distribution of normalized levels of each 
mRNA species. The level measured in a particular tumor sample to be analyzed falls at some 
percentile within this range, which can be determined by methods well known in the art. Below, 
unless noted otherwise, reference to expression levels of a gene assume normalized expression 
relative to the reference set although this is not always explicitly stated. 

30 The breast cancer gene set is shown in Table L The gene Accession Numbers, and the 

SEQ ID NOs for the forward primer, reverse primer and amplicon sequences that can be used for 
gene amplification,, are listed in Table 2. The basis for inclusion of markers, as well as the 
clinical significance of mRNA level variations with respect to the reference set, is indicated 
below. Genes are grouped into subsets based on the type of clinical significance indicated by 
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their expression levels: A. Prediction of patient response to drugs used in breast cancer treatment, 
or to drugs that are approved for other indications and could be used off-label in the treatment of 
breast cancer. B. Prognostic for survival or recurrence of cancer. 

C. Prediction of Patient Response to Therapeutic Drugs 
5 1. Molecules that specilicallv influence cellular sensitivity to dru^s 

Table 1 lists 74 genes (shown in italics) that specifically influence cellular sensitivity to 
potent drugs, which are also listed. Most of the drugs shown are approved and already used to 
treat breast cancer (e-g.^ anthracyclines; cyclophosphamide; methotrexate; 5-FU and analogues). 
Several of the drugs are used to treat breast cancer off-label or are in clinical development phase 
10 (e.g., bisphosphonates and anti-VEGF mAb). Several of the drugs have not been widely used to 
treat breast cancer but are used in other cancers in which the indicated target is expressed (e.g., 
Celebrex is used to treat familial colon cancer; cisplatin is used to treat ovarian and other 
cancers.) 

Patient response to 5FU is indicated if normalized thynfiidylate synthase rtiRNA amount is 
15 at or below the 15*** percentile, or the sum of expression of thymidylate synthase plus 
dihydropyrimidine phosphoiylase is at or below the 25* percentile, or the sum of expression of 
tliese mRNAs plus thymidine phosphorylase is at or below the 20* percentile. Patients with 
dihydropj^midine dehydrogenase below 5* percentile are at risk of adverse response to 5FU, or 
analogs such as Xeloda. 

20 When levels of thymidylate synthase, and dihydropyrimidine dehydrogenase, are within 

the acceptable range as defined in the preceding paragraph, amplification of c-myc mRNA in the 
upper 15%, against a background of wild-type p53 [as defined below] predicts a beneficial 
response to 5FU (see Arango et al, Cancer Res. 61:4910-4915 (2001)). In the presence of 
normal levels of thymidylate synthase and dihydropyrimidine dehydrogenase, levels of NFkB 

25 and cIAP2 in the upper 10% indicate resistance of breast tumors to the chemotherapeutic drag 
5FU. 

Patient resistance to anthracyclines is indicated if the normalized mRNA level of 
topoisomerase Ila is below the 10* percentile, or if the topoisomerase Hp normalized mRNA 
level is below the 10^^ percentile or if the combined normalized topoisomerase Ila and jj signals 
30 are below the 10* percentile. 

Patient sensitivity to methotrexate is compromised if DHFR levels are more than tenfold 
higher than the average reference set level for this mRNA species, or if reduced folate carrier 
levels are below 10* percentile. 
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Patients whose tumors express CYPIBI in the upper 10%, have reduced likelihood of 
responding to docetaxoL 

The sum of signals for aldehyde dehydrogenase lAl and 1A3, when more than tenfold 
higher than the reference set average, indicates reduced likelihood of response to 
5 • cyclophosphamide. 

Currently, estrogen and progesterone receptor expression as measured by 
immimohistochemistry is used to select patients for anti-estrogen therapy. We have 
demonstrated RT-PCR assays for estrogen and progesterone receptor mRNA levels that predict 
levels of these proteins as determined by a standard clinical diagnostic tests, with high degree of 
10 concordance (Figures 6 and 7). 

Patients whose tumors express ERa or PR mRNA in the upper 70%, are likely to respond 
to tamoxifen or other anti-estrogens (thus, operationally, lower levels of ERa than this are to 
defined ERa-negative). However, when the signal for microsomal epoxide hydrolase is in the 
upper 10% or when mRNAs for pS2/trefoil factor, GATA3 or human chorionic gonadotropin are 
15 at or below average levels found in ERa-negative tumors, anti-estrogen therapy will not be 
beneficial. 

Absence of XIST signal compromises the likelihood of response to taxanes, as does 
elevation of the GST-tt or prolyl endopeptidase [PREP] signal in the upper 10%. Elevation of 
PLAGl in the upper 10% decreases sensitivity to taxanes. 
20 Expression of ERCCl mRNA in the upper 10% indicate significant risk of resistance to 

cisplatin or analogs. 

An RT-PCR assay of Her2 mRNA expression predicts Her2 overexpression as measured 
by a standard diagnostic test, with high degree of concordance (data not shown). Patients whose 
tumors express Her2 (normalized to cyp.l) in the upper 10% have increased likelihood of 

25 beneficial response to treatment with Herceptin or other ErbB2 antagonists. Measurement of 
expression of Grb7 mRNA serves as a test for HER2 gene amplification, because the Grb7 gene 
is closely linked to Her2. When Her2 is expression is high as defined above in this paragraph, 
similarly elevated Grb7 indicates Her2 gene amplification. Overexpression of IGFIR and or 
IGFl or IGF2 decreases likelihood of beneficial response to Herceptin and also to EGFR 

30 . antagonists. 

Patients whose tumors express mutant Ha-Ras, and also express famesyl pyrophosphate 
synthetase or geranyl pyrophosphonate ^thetase mRNAs at levels above the tenth percentile 
comprise a group that is especially likely to exhibit a beneficial response to bis-phosphonate 
drugs. 
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Cox2 is a key control en2yme in thie synthesis of prostaglandins. It is frequently expressed 
at elevated levels in subsets of various types of carcinomas including carcinoma of the breast. 
Expression of this gene is controlled at the transcriptional level, so RT-PCR serves a valid 
indicator of the cellular enzjmie activity. Nonclinical research has shown that cox2 promotes 
tumor angiogenesis, suggesting that this enzyme is a promising drug target in solid tumors- 
Several Cox2 antagonists are marketed products for use in anti-inflammatory conditions. 
Treatment of familial adenomatous polyposis patients \vith the cox2 inhibitor Celebrex 
significantly decreased the number and size of neoplastic polyps. No cox2 inhibitor has yet been 
approved for treatment of breast cancer, but generally this class of drugs is safe and could be 
prescribed off-label in breast cancers in which cox2 is over-expressed. Tumors expressing 
COX2 at levels in the upper ten percentile have increased chance of beneficial response to 
Celebrex or other cyclooxygenase 2 inhibitors. 

The tyrosine kinases ErbBl [EGFR], ErbB3 [Her3] and ErbB4 [Her4]; also the ligands 
TGFalpha, amphiregulin, heparin-binding EGF-like growth factor, and epiregulin; also BRK, a 
non-receptor kinase. Several dmgs in clinical development block the EOF receptor. ErbB2-4, the 
indicated ligands, and BRK also increase the activity of the EGFR pathway. Breast cancer 
patients whose tumors express high levels of EGFR or EGFR and abnormally high levels of the 
other indicated activators of the EGFR pathway are potential candidates for treatment wifli an 
EGFR antagonist 

Patients whose tumors express less than 10% of the average level of EGFR mRNA 
observed in the reference panel are relatively less likely to respond to EGFR antagonists [such as 
Iressa, or ImClone 225]. hi cases in which the EGFR is above this low range, the additional 
presence of epiregulin, TGFa, amphiregulin, or ErbB3, or BRK, CD9, MMP9, or Lotl at levels 
above the 90*^ percentile predisposes to response to EGFR antagonists. Epiregulin gene 
expression, in particular, is a good surrogate marker for EGFR activation, and can be used to not 
only to predict response to EGFR antagonists, but also to monitor response to EGFR antagonists 
[taking fine needle biopsies to provide tumor tissue during treatment]. Levels of CD82 above the 
90* percentile suggest poorer efficacy fi-om EGFR antagonists. 

The tyrosine kinases abl, c-kit, PDGFRalpha, PDGFbeta, and ARG; also, the signal 

transmitting ligands c-kit ligand, PDGFA, B, C and D. The listed tyrosine kinases are all targets 

of the drug Gleevec™ (imatinib mesylate, Novartis), and the listed ligands stimulate one or more 

of the listed tyrosine kinases. In the two indications for which Gleevec™ is approved, tyrosine 

kinase targets (bcr-abl and ckit) are overexpressed and also contain activating mutations. A 

finding that one of the Gleevec™ target tyrosine kinase targets is expressed in breast cancer tissue 
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will prompt a second stage of analysis wherein the gene will be sequenced to determine whether- 
it is mutated. That a mutation found is an activating mutation can be proved by methods known 
in the art, such as, for example, by measuring kinase enzyme activity or by measuring 
phosphorylation status of the particular kinase, relative to the corresponding wild-type kinase. 
Breast cancer patients whose tumors express high levels of mRNAs encoding Gleevec™ target 
tyrosine kinases, specifically, in the upper ten percentile, or mRNAs for Gleevec™ target tyrosine 
kinases in the average range and mRNAs for their cognate growth stimulating ligands in the 
upper ten percentile, are particularly good candidates for treatment v^dth Gleevec™. 

VEGF is a potent and pathologically important angiogenic factor. (See below under 
Prognostic Indicators.) When VEGF mRNA levels are in the upper ten percentile, aggressive 
treatment is warranted. Such levels particularly suggest the value of treatment with anti- 
angiogenic drugs, including VEGF antagonists, such as anti-VEGF antibodies. Additionally, 
KDR or CD3 1 mRNA level in the upper 20 percentile further increases likelihood of benefit from 
VEGF antagonists. 

Faraesyl pyrophosphatase synthetase and geranyl geranyl pyrophosphatase synthetase. 
These enzymes are targets of commercialized bisphosphonate dmgs, which were developed 
originally for treatment of osteoporosis but recently have begun to prescribe them off-label in 
breast cancer. Elevated levels of mRNAs encoding these enzymes in breast cancer tissue, above 
the 90* percentile, suggest use of bisphosphonates as a treatment option. 

2. Multidrug Resistance Factors 

These factors include 10 Genes: gamma glutamyl cysteine synthetase [GCS]; GST-a; 
GST-Tt; MDR-1; MRPl-4; breast cancer resistance protein [BCRP]; lung resistance protein 
[MVP];SXR; YB-1, 

GCS and both GST-a and GST-tc regulate glutathione levels, which decrease cellular 
sensitivity to chemotherapeutic drugs and other toxins by reductive derivatization. Glutathione is 
a necessary cofactor for multi-dmg resistant pumps, MDR-1 and the MRPs. MDRl and MRPs 
function to actively transport out of cells several important chemotherapeutic drugs used in breast 
cancer. 

GSTs, MDR-1, and MRP-1 have all been studied extensively to determine possible have 
prognostic or predictive significance in human cancer. However, a great deal of disagreement 
exists in the literature with respect to these questions. Recently, new members of the MRP 
family have been identified: MRP-2, MRP-3, MRP.4, BCRP, and lung resistance protem [major 
vault protein]. These have substrate specificities that overlap with.those of MDR-1 and MRP-1. 
The incorporation of all of these relevant ABC family members as well as glutathione synthetic . 
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enzymes into the present invention captures the contribution of this family to drug resistance, in a 
way that single or double analyte assays cannot. 

MRP- 1, the gene coding for the multidrug resistance protein. 

P-glycoprotein, is not regulated primarily at the transcriptional level. However, p- 
5 glycoprotein stimulates the transcription of FTP 1 b. An embodiment of the present invention is 
the use of the level of the mKNA for Ihe phosphatase PTPlb as a snrrogate measure of MRP-l/p- 
glycoprotein activity. 

The gene SXR is also an activator of multidmg resistance, as it stimulates transcription of 
certain multidmg resistance factors. 
10 The impact of multidmg resistance factors with respect to chemotherapeutic agents used 

in breast cancer is as follows. Beneficial response to doxombicin is compromised when the 
mRNA levels of either MDRl, GSTa, GSTtc, SXR, BCRP YB-1, or LRP/MVP are in the upper 
four percentile. Beneficial response to methotrexate is inhibited if mRNA levels of any of 
MRPl, MRP2, MRP3, or MRP4 or gamma-glutamyl cysteine synthetase are in the upper four 
15 percentile. 

3. Eukarvotic Translation Initiation Factor 4E rEIF4E] 

EIF4E mRNA levels provides evidence of protein expression and so expands the 
capability of RT-PCR to indicate variation in gene expression. Thus, one claim of the present 
invention is the use of EIF4E as an added indicator of gene expression of certain genes [e.g., 

20 cyclinDl, mdm2, VEGF, and others]. For example, in two tissue specimens containing the same 
amount of normalized VEGF mRNA, it is likely that the tissue containing the higher normalized 
level of EIF4E exhibits the greater level of VEGF gene expression. 

The background is as follows. A key point in the regulation of mRNA translation is 
selection of mRNAs by the EIF4G complex to bind to the 43S ribosomal subunit. The protein 

25 EIF4E [the m7G CAP-binding protein] is often limiting because more mRNAs than EIF4E 
copies exist in cells. Highly structured 5'UTRs or highly GC-rich ones are inefficiently 
translated, and these often code for genes that carry out functions relevant to cancer [e.g., 
cyclinDl, mdm2, and VEGF]. EIF4E is itself regulated at the transcriptional/ mRNA level. 
Thus, expression of EIF4E provides added indication of increased activity of a number of 

30 proteins. 

It is also noteworthy that overexpression of EIF4E transforms cultured cells, and hence is 
an oncogene. Overexpression of EIF4E occurs in several different ^es of carcinomas but is 
particularly significant in breast cancer. EIF4E is typically expressed at very low levels in normal 
breast tissue. 
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D, Prognostic Indicators 

1. DNA Repair Ekzvmes 

Loss of BRCAl or BRCA2 activity via mutation represents the critical oncogenic step in 
the most common type[s] of familial breast cancer. The levels of mRNAs of these important 
5 enzymes are abnormal in subsets of sporadic breast cancer as well. Loss of signals from either 
[to within the lower ten percentile] heightens risk of short survival. 

2. Cell Cycle Remtlators 

Cell cycle regulators include 14 genes: c-MYC; c-Src; Cyclin Dl; Ha-Ras; mdm2; 
pl4ARF; p21 WAFl/CIP; pl6INK4a/pl4; p23; p27; p53; PI3K; PKC-epsilon; PKC-delta. 

10 The gene for p53 [TP53] is mutated in a large fraction of breast cancers. Frequently p53 

levels are elevated v^hen loss of function mutation occurs. When the mutation is dominant- 
negative, it creates survival value for the cancer cell because growth is promoted and apoptosis is 
inhibited. Thousands of different p53 mutations have been found in human cancer, and the 
functional consequences of many of them are not clear. A large body of academic literature 

15 addresses the prognostic and predictive significance of mutated p53 and the results are highly 
conflicting. The present invention provides a functional genomic measure of p53 activity, as 
follows. The activated wild type p53 molecule triggers transcription of the cell cycle inhibitor 
p21 . Thus, the ratio of p53 to p21 should be low when p53 is wild-type and activated. When p53 
is detectable and the ratio of p53 to p21 is elevated in tumors relative to normal breast, it 

20 signifies nonfunctional or dominant negative p53. The cancer literature provides evidence for 
this as bom out by poor prognosis. 

Mdni2 is an important p53' regulator. Activated wildtype p53 stimulates transcription of 
mdm2- The mdm2 protein binds p53 and promotes its proteolytic destruction. Thus, abnormally 
low levels of mdni2 in the presence of normal or higher levels of p53 indicate that p53 is mutated 

25 and inactivated. 

One aspect of the present invention is the use of ratios of mRNAs levels p53:p21 and 
p53:mdm2 to provide a picture of p53 status. Evidence for dominant negative mutation of p53 
(as indicated by high p53:p21 and/or high p53:mdm2 mRNA ratios — specifically in the upper ten 
percentile) presages higher risk of recurrence in breast cancer and therefore weights toward a 
30 decision to ixse chemotherapy in node negative post surgery breast cancer. 

Another important cell cycle regulator is p27, which in the activated form blocks cell 
cycle progression at the level of cdk4. The protein is regulated primarily via 
phosphoiylation/dephosphoiylation, rather than at the transcriptional level. However, levels of 
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p27 mRNAs do vary. Therefore a level of p27 mRNA in the upper ten percentile indicates 
reduced risk of recurrence of breast cancer post surgery. 

Cyclin Dl is a principle positive regulator of entry into S phase of the cell cycle. The gene 
for cyclin Dl is amplified in about 20% of breast cancer patients, and therefore promotes tumor 
5 promotes tumor growth in those cases. One aspect of the present invention is use of cyclin Dl 
mRNA levels for diagnostic purposes in breast cancer. A level of cyclin Dl mRNA in the upper 
ten percentile suggests high risk of recurrence in breast cancer following surgery and suggests 
particular benefit of adjuvant chemotherapy. 

3. Other tumor suppressors and related proteins 

10 These include APC and E-cadherin. It has long been known that the tumor suppressor 

APC is lost in about 50% of colon cancers, with concomitant transcriptional upregulation of E- 
cadherin, an important cell adhesion molecule and growth suppressor. Recently, it has been 
found that the APC gene silenced in 15-40 % of breast cancers. Likewise, the E-cadherin gene is 
silenced [via CpG island methylation] in about 30% of breast cancers. An abnormally low level 

15 of APC and/or E-cadherin mRNA in the lower 5 percentile suggests high risk of recurrence in 
breast cancer following surgery and heightened risk of shortened survival, 

4. Resulators ofApoptosis 

These mclude BCl/BAX family members BC12, Bcl-xl, Bak, Bax and related factors, 
NFk-B and related factors, and also p53BPl/ASPPl and p53BP2/ASPP2. 

20 Bax and Bak are pro-apoptotic and BC12 and Bcl-xl are anti-apoptotic. Therefore, the 

ratios of these factors influence the resistance or sensitivity of a cell to toxic (pro-apoptotic) 
drugs. In breast cancer, unlike other cancers, elevated level of BC12 (in the upper ten percentile) 
correlates with good outcome. This reflects the fact that BC12 has growth inhibitory activity as 
well as anti-apoptotic activity, and in breast cancer the significance of the former activity 

25 outweighs the significance of the latter. The impact of BCI2 is in turn dependent on the status of 
the growth stimulating transcription factor c-MYC. The gene for c-MYC is amplified in about 
20% of breast cancers. When c-MYC message levels are abnormally elevated relative to BC12 
(such that this ratio is in the upper ten percentile), then elevated level of BC12 mRNA is no 
longer a positive indicator. 

30 NFk-B is another important anti-apoptotic factor. Originally, recognized as a pro- 

inflammatory transcription factor, it is now clear that it prevents programmed cell death in 
response to several extracellular toxic factors [such as tumor necrosis factor]. The activity of this 
transcription factor is regulated principally via phosphorylation/dephosphoiylation events. 
However, levels of NFk-B nevertheless do vary firom cell to cell, and elevated levels should 
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correlate with increased resistance to apoptosis. Importantly for present purposes, NFk-B, exerts 
its anti-apoptotic activity largely through its stimulation of transcription of mRNAs encoding 
certain members of the lAP [inhibitor of apoptosis] family of proteins, specifically cIAPl, 
cIAP2, XIAP, and Survivin. Thus, abnormally elevated levels of mRNAs for these lAPs and for 
5 NFk-B any in the upper 5 percentile] signify activation of the NFk-B anti-apoptotic pathway. 
This suggests high risk of recurrence in breast cancer following chemotherapy and therefore poor 
prognosis. One embodiment of the present invention is the inclusion in the gene set of the above 
apoptotic regulators, and the above-outlined use of combinations and ratios of the levels of their 
mRNAs for prognosis in breast cancer. 

10 The proteins p53BPl and 2 bind to p53 and promote transcriptional activation of pro- 

apoptotic genes. The levels of p53BPl and 2 are suppressed in a significant fraction of breast 
cancers, correlating with poor prognosis. When either is expressed in the lower tenth percentile 
poor prognosis is indicated. 

5. Factors that control cell invasion and angio^enesis 

15 These include uPA, PAIl, cathqpsinsB, G and L, scatter factor [HGF], c-met, KDR, 

VEGFj, and CD31. Hie plasminogen activator uPA and its sexpin regulator PAIl promote 
breakdown of extracellular matrices and tumor cell invasion. Abnormally elevated levels of both 
mRNAs in malignant breast tumors (in the upper twenty percentile) signify an increased risk of 
shortened survival, increased recurrence in breast cancer patients post surgery, and increased 

20 importance of receiving adjuvant chemotherapy. On the other hand, node negative patients 
whose tumors do not express elevated levels of these mRNA species are less likely to have 
recurrence of this cancer and could more seriously consider whether the benefits of standard 
chemotherapy justifies the associated toxicity. 

Cathepsins B or L, when expressed in the upper ten percentile, predict poor disease-free 

25 and overall survival. In particular, cathepsin L predicts short survival in node positive patients. 

Scatter factor and its cognate receptor c-met promote cell motility and invasion, cell 
growth, and angiogenesis. In breast cancer elevated levels of mRNAs encoding these factors 
should prompt aggressive treatment with chemotherapeutic drugs, when expression of either, or 
the combination, is above the 90^ percentile. 

30 VEGF is a central positive regulator of angiogenesis, and elevated levels in solid tumors 

predict short survival [note many references showing that elevated level of VEGF predicts short 
survival]. Inhibitors of VEGF therefore slow the growth of solid tumors in animals and humans. 
VEGF activity is controlled at the level of transcription. VEGF mRNA levels in the upper ten 
percentile indicate significantly worse than average prognosis. Other markers of vasciilarization, 
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CD31 [PECAM], and KDR indicate high vessel density in tumors and that the tumor will be 
particularly malignant and aggressive, and hence that an aggressive therapeutic strategy is 
warranted, 

6. Markers for Immune and Inflammatory Cells and Processes 

5 These markers include the genes for Immimoglobulin light chain X, GDI 8, CDS, CD68, 

Fas [CD953, and Fas Ligand. 

Several lines of evidence suggest that the mechanisms of action of certain dmgs used in 
breast cancer entail activation of the host immune/inflammatory response (For example, 
Herceptin®)- One aspect of the present invention is the inclusion in the gene set of markers for 
10 inflammatory and immune cells, and markers that predict tumor resistance to immune 
surveillance. Immunoglobulin light chain lambda is a marker for immunoglobulin producing 
cells. CD 18 is a marker for all white cells. CDS is a marker for T-cells. CD68 is a marker for 
macrophages. 

CD95 and Fas ligand are a receptor: ligand pair that mediate one of two major pathways 
15 by which cytotoxic T cells and NK cells kill targeted cells. Decreased expression of CD95 and 
increased expression of Fas Ligand indicates poor prognosis in breast cancer. Both CD95 and 
Fas Ligand are transmembrane proteins, and need to be membrane anchored, to trigger cell death. 
Certain tumor cells produce a truncated soluble variant of CD95, created as a result of alternative 
splicing of the CD95 mRNA. This blocks NK cell and cytotoxic T cell Fas Ligand-mediated 
20 killing of the tumors cells. Presence of soluble CD95 correlates with poor survival in breast 
cancer. The gene set includes both soluble and full-length variants of CD95. 

7. Cell proliferation markers 

The gene set includes the cell proliferation markers Ki67/MiBl, PCNA, Pinl, and 
thymidine kinase. High levels of expression of proliferation markers associate with high 
25 histologic grade, and short survival. High levels of thymidine kinase in the upper ten percentile 
suggest in creased risk of short survival. Pinl is a prolyl isomereise that stimulates cell growth, in 
part through the transcriptional activation of the cyclin Dl gene, and levels in the upper ten 
percentile contribute to a negative prognostic profile. 

8. Other srowth factors and receptors 

SO This gene set includes IGFl, IGF2, IGFBP3, IGFIR, FGF2, FGFRl, CSF4R/fms, CSF-1, 

IL6 and IL8. All of these proteins are expressed in breast cancer. Most stimulate tumor growth. 
However, expression of the growth factor FGF2 correlates with good outcome. Some have anti- 
apoptotic activity, prominently IGFl. Activation of the IGFl axis via elevated IGFl, IGFIR, or 
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IGFBP3 (as indicated by the sum of these signals in the upper ten percentile) inhibits tumor cell 
, death and strongly contributes to a poor prognostic profile. 

9. Gene expression markers that define subclasses of breast cancer 
These , include: GROl oncogene alpha, Grb7, cytokeratins 5 and 17, retinal binding 
5 protein 4, hepatocyte nuclear factor 3, integrin alpha 7, and lipoprotein lipase. These^ markers 
subset breast cancer into different cell types that are phenotypically different at the level of gene 
expression. Tumors expressing signals for Bcl2, hepatocyte nuclear factor 3, LIVl and ER above 
the mean have the best prognosis for disease free and overall survival following siu-gical removal 
of the cancer. Another category of breast cancer tumor type, characterized by elevated 
10 expression of lipoprotein lipase, retinol binding protein 4, and integrin a?, carry intermediate 
prognosis- Tumors expressing either elevated levels of cytokeratins 5, and 1 7, GRO oncogene at 
levels four-fold or greater above the mean, or ErbB2 and Grb7 at levels ten-fold or more above 
the mean, have worst prognosis. 

Although throughout the present description, including the Examples below, various 
15 aspects of the invention are explained with reference to gene expression studies, the invention 
can be performed in a similar manner, and similar results can be reached by applying proteomics 
techniques that are well known in the art. The proteome is the totality of the proteins present in a 
sample (e.g. tissue, organism, or cell culture) at a certain point of time. Proteomics includes, 
among other things, study of the global changes of protein expression in a sample (also referred 
20 to as "expression proteomics"). Proteomics typically includes the following steps: (1) separation 
of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of 
the individual proteins recovered from the gel, e.g. my mass spectrometry and/or N-terminal 
sequencing, and (3) analysis of the data using bioinformatics. Proteomics methods are valuable 
supplements to other methods of gene expression profiling, and can be used, alone or in 
25 combination with other methods of the present invention, to detect the products of the gene 
markers of the present invention. 

Further details of the invention will be described in the following non-limiting Examples. 
Example 1 

Isolation of RNA from formalin-fixed, paraffin-embedded (FPET) tissue specimens 
30 ' A. Protocols 

I. EPICENTRE® Xylene Protocol ' 
RNA Isolation 

(1) Cut 1-6 sections (each 10 pm thick) of paraffin-embedded tissue per sample using 
a clean microtome blade and place into a 1 .5 ml eppendorf tube. 
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(2) To extract paraffin, add 1 ml of xylene and invert the tubes for 10 minutes by 
rocking on a nutator. 

(3) Pellet the sections by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentriftige. 

(4) Remove the xylene, leaving some in the bottom to avoid dislodging the pellet. 

(5) Repeat steps 2-4. 

(6) Add 1 ml of 100% ethanol and^ invert for 3 minutes by rocking on the nutator. 

(7) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

(8) Remove the ethanol, leaving some at the bottom to avoid the pellet. 

(9) Repeat steps 6-8 twice. 

(10) Remove all of the remaining ethanol. 

(1 1) For each sample, add 2 pi of 50 iig/yl Proteinase K to 300 |xl of Tissue and Cell 
Lysis Solution. 

(12) Add 300 |xl of Tissue and Cell Lysis Solution containing the Proteinase K to each 
sample and mix thoroughly. 

(13) Incubate at 65 °C for 90 minutes (vortex mixing every 5 minutes). Visually 
monitor the remaining tissue fragment. If still visible after 30 minutes, add an additional 2 jil of 
50 pg/fil Proteinase K and continue incubating at 65 "^C imtil fragment dissolves. 

(14) Place the samples on ice for 3-5 minutes and proceed with protein removal and 
total nucleic acid precipitation. 

Protein Removal and Precipitation of Total Nucleic Acid 

(1) Add 150 |xl of MPC Protein Precipitation Reagent to each lysed sample and 
vortex vigorously for 10 seconds. 

(2) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

(3) Transfer the supernatant into clean eppendorf tubes and discard the pellet. 

(4) Add 500 \i\ of isopropanol to the recovered supernatant and thoroughly mix by 
rocking on the nutator for 3 minutes. 

(5) Pellet the RNA/DNA by centrifi^ation at 4 ^'C for 10 minutes at 14,000 x g in an 
eppendorf microcentrifuge. 

(6) Remove all of the isopropanol willi a pipet, being careful not to dislodge the 

pellet. 
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Removal of Contaminating DNA from RNA Preparations 

(1) Prepare 200 fxl of DNase I solution for each sample by adding 5 jil of RNase-Free 
DNase I (1 U/jal) to 195 ^il of IX DNase Buffer. 

(2) Completely resuspend the pelleted RNA in 200 p.! of DNase I solution by 
5 vortexing. 

(3) Incubate the samples at 37 ""C for 60 minutes. 

(4) Add 200 III of 2X T and C Lysis Solution to each sample and vortex for 5 
seconds. 

(5) Add 200 III of MPC Protein Precipitation Reagent, mix by vortexing for 10 
. 10 seconds and place on ice for 3-5 minutes. 

(6) Pellet tile debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

(7) Transfer the supernatant containing the RNA to clean eppendorf tubes and discard 
the pellet (Be careful to avoid transferring the pellet) 

15 (8) Add 500 pi of isopropanol to each supernatant and rock samples on the nutator for 

3 minutes. 

(9) Pellet flie RNA by centrifugation at 4 *^C for 10 minutes at 14,000 x g in an 
eppendorf microcentrifuge. 

(10) Remove the isopropanol, leaving some at the bottom to avoid dislodging the 

20 pellet. 

(1 1) Rmse twice with 1 ml of 75% ethanoL Centrifuge briefly if the RNA pellet is 
dislodged, 

(12) Remove ethanol carefiiUy* 

(13) Set under fume hood for about 3 minutes to remove residual efhanoL 
25 (14) Resuspend tiie RNA in 30 yl of TE Buffer and store at -30 °C. 

n. Hot Wax/Urea Protocol of the Invention 
RNA Isolation 

(1) Cut 3 sections (each 10 pih thick) of paraffin-embedded tissue using a clean 
microtome blade and place into a 1.5 ml eppendorf tube. 
30 (2) Add 300 id of lysis buffer (10 mM Tris 7.5, 0.5% sodium lauroyl sarcosine, 0.1 

mM EDTA pH 7.5, 4M Urea) containing 330 ^g/ml Proteinase K (added freshly from a 50 fxg/^1 
stock solution) and vortex briefly. 
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(3) Incubate at 65 °C for 90 minutes (vortex mixing every 5 minutes). Visually 
monitor the tissue fragment. If still visible after 30 minutes, add an additional 2 fxl of 50 |Lig/|al 
Proteinase K and continue incubating at 65 °C until fragment dissolves. 

(4) Centrifuge for 5 minutes at 14,000 x g and transfer upper aqueous phase to new 
5 tube, being careful not to disrupt the paraffin seaL 

(5) Place the samples on ice for 3-5 nciinutes and proceed with protein removal and 
total nucleic acid precipitation. 

Protein Removal and Precipitation of Total Nucleic Acid 

(1) Add 150 p.1 of 7.5M NHUOAc to each lysed sample and vortex vigorously for 10 
10 seconds. 

(2) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

(3) Transfer the supernatant into clean eppendorf tubes and discard the pellet. 

(4) Add 500 of isopropanol to the recovered supernatant and thoroughly mix by 
1 5 rocking on the nutator for 3 minutes. 

(5) Pellet the RNA/DNA by centrifugation at 4 °C for 10 minutes at 14,000 x g in an 
eppendorf microcentrifuge. 

(6) Remove all of the isopropanol with a pipet, being careful not to dislodge the 

pellet. 

20 Removal of Contaminating DNA from RNA Preparations 

(1) Add 45 pi of IX DNase I buffer (1 0 mM Tris-Cl, pH 7,5, 2.5 mM MgCb, 0,1 mM 
CaCh) and 5 iil of RNase-Free DNase I (2U/fxl, Ambion) to each sample. 

(2) Incubate the samples at 37 **C for 60 minutes. 
Inactivate the DNasel by heating at 70 for 5 minutes. 

25 B. Results 

Experimental evidence demonstrates that the hot RNA extraction protocol of the 

invention does not compromise RNA yield. Using 19 FPE breast cancer specimens, extracting 

RNA from three adjacent sections in the same specimens, RNA yields were measured via 

capillary electrophoresis with fluorescence detection (Agilent Bioanalyzer). Average RNA yields 

30 in nanograms and standard deviations with the invented and cormnerdal methods, respectively, 

were: 139+A21 versus 141+A34. 

Also, it was found that the urea-containing lysis buffer of the present invention can be 

substituted for the EPICENTRE® T&C lysis buffer, and the 7.5 M NH4OAC reagent used for 

protein precipitation in accordance with the present invention can be substituted for the 
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EPICENTRE® MPC protein precipitation solution with neither significant compromise of RNA 
yield nor TaqMan® efficiency. 
Example 2 

Amplification of mRNA Species Prior to RT-PCR 

The method described in section 10 above was used with RNA isolated from fixed, 
paraffin-embedded breast cancer tissue. TaqMan® analyses were performed with first strand 
cDNA generated with the T7-GSP primer (unampiified (T7-GSPr)), T7 amplified RNA 
(amplified (T7-GSPr)). RNA was amplified according to step 2 of Figure 4. As a control, 
TaqMan® was also performed with cDNA generated with an uimiodified GSPr (amplified 
(GSPr)). An equivalent amount of initial template (1 ng/well) was used in each TaqMan® 
reaction. 

The results are shown in Figure 8. In vitro transcription increased RT-PCR signal 
intensity by more than 10 fold, and for certain genes by more than 100 fold relative to controls in 
which the RT-PCR primers were the same primers used in method 2 for the generation of 
double-stranded DNA for in vitro transcription (GSP-T7r and GSPf). Also shown in Figure 8 are 
RT-PCR data generated when standard optimized RT-PCR primers (i.e„ lacking T7 tails) were 
used. As shown, compared to this control, the new method yielded substantial increases in RT- 
PCR signal (fi-om 4 to 64 fold in this experiment). 

The new method requures that each T7-GSP sequence be optimized so that the increase in 
the RT-PCR signal is the same for each gene, relative to the standard optimized RT-PCR (with 
non-T7 tailed primers). 

Example 3 

A Study of Gene Expression in Premalignant and Malignant Breast Txmiors ^ 
A gene expression study was designed and conducted with the primary goal to 
molecularly characterize gene expression in paraffin-embedded, fixed tissue samples of invasive 
breast ductal carcinoma, and to explore the correlation between such molecular profiles and 
disease-firee survival. A fiirther objective of the study was to compare the molecular profiles in 
tissue samples of invasive breast cancer with the molecular profiles obtained in ductal carcinoma 
in situ. The study was further designed to obtain data on the molecular profiles in lobular 
carcinoma in situ and in paraffin-embedded, fixed tissue samples of invasive lobular carcinoma. 

Molecular assays were performed on paraffin-embedded, formalin-fixed primary breast 
tumor tissues obtained fi-om 202 individual patients diagnosed with breast cancer. All patients 
underwent surgery with diagnosis of invasive ductal carcinoma of the breast, pure ductal 
carcinoma in situ (DCIS), lobular carcinoma of the breast, or pure lobular carcinoma in situ 
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(LCIS), Patients were included in the study only if histopathologic assessnGtent^ performed as 
described in the Materials and Methods section, indicated adequate amounts of tumor tissue and 
homogeneous pathology. 

The individuals participating in the study were divided into the following groups: 
5 Group 1 : Pure ductal carcinoma in situ (DCIS); n=l 8 

Group 2: Invasive ductal carcinoma n=l 30 

Group 3: Pure lobular carcinoma in situ (LCIS); n=7 

Group 4: Invasive lobular carcinoma n=l 6 

10 Materials and Methods 

Each representative tumor block was characterized by standard histopathology for 
diagnosis, semi-quantitative assessment of amount of tumor, and tumor grade. A total of 6 
sections (10 microns in thickness each) were prepared and placed in two Costar Brand 
Microcentrifuge Tubes (Polypropylene, 1.7 mL tubes, clear; 3 sections in each tube). If the 

15 tumor constituted less than 30% of the total specimen area, the sample may have been crudely 
dissected by the pathologist, using gross microdissection, putting the tumor tissue directly into 
the Costar tube. 

If more than one tumor block was obtained as part of the surgical procedure, all tumor 
blocks were subjected to the same characterization, as described above, and the block most 
20 representative of the pathology was used for analysis. 

Gene Expression Analysis 

mRNA was extracted and purified from fixed, paraffin-embedded tissue samples, and 
prepared for gene expression analysis as described in chapters 7-1 1 above. 

Molecular assays of quantitative gene expression were performed by RT-PCR, using the 
25 ABI PRISM 7900™ Sequence Detection System™ (Perkin-Ehner-Applied Biosystems, Foster 
City, CA, USA). ABI PRISM 7900™ consists of a thennocycler, laser, charge-coupled device 
(CCD), camera and computer. The system amplifies samples in a 384-well format on a 
thermocycler. During amplification, laser-induced fluorescent signal is collected in real-time 
through fiber optics cables for all 384 wells, and detected at the CCD. The system includes 
30 software for running the instrument and for analyzing the data. 

Analysis and Results 

Tumor tissue was analyzed for 185 cancer-related genes and 7 reference genes. The 
threshold cycle (CT) values for each patient were normalized based on the median of all genes 
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for that particular patient. Clinical outcome data were available for all patients from a review of 
registry data and selected patient charts. 
Outcomes were classified as: 

0 died due to breast cancer or to unknown cause or alive with breast cancer 
5 recurrence; 

1 alive without breast cancer recurrence or died due to a cause other than breast 

cancer 

Analysis was performed by: 

1. Analysis of the relationship between normalized gene expression and the binary 
1 0 outcomes of 0 or 1 . 

2. Analysis of the relationship between normalized gene expression and the time to 
outcome (0 or 1 as defined above) where patients who were alive without breast cancer 
recurrence or who died due to a cause other than breast cancer were censored. This approach 
was used to evaluate the prognostic impact of individual genes and also sets of multiple genes. 

15 Analysis of 147 patients with invasive breast carcinoma by binary approach 

In the first (binary) approach, analysis was performed on all 146 patients with invasive 
breast carcinoma, A t test was performed on the group of patients classified as 0 or 1 and the p- 
values for the differences between the groups for each gene were calculated. 
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The following Table 4 lists the 45 genes for which the p-value for the differences between 
the groups was <0,05. 

Table 4 



Gene/ 

SEOIDNO: 


MeanCT 
Alive 


Mean CT 
Deceased 


t-value 


Degrees of 
freedom 


P 






'^9 ^9 


1 09 


1 AA 


U.UUUl 


JrlN./VLVlJD 






1 71 
3, /I 


1 AA 
144 


n nnAi 
U.UUu3 






9Q ^0 


1 ^1 


1 AA 
144 


fv nr\AA 


o 1 Jvl 


-jU.oZ 






1 AA 

144 


A AAA^ 








1 Id 


1AA 

144 


A AAAA 


JN.1-0 1 




90 AO 


1 lA 


144 


A AA1 1 


vJO 1 iVl 1 




11 Ai 


1 n 
-J .2/ 


144 


A AA1 A 


r*AO 


"XA OA 




1 1 Q 


144 


A AA1 O 

0.00 lo 


rj\. 




11 99 
^I.ZZ 


1 1 A 


1 AA 

144 


A AA1 A 




^ 1 


19 1 n 


1 1 n 


144 


A AAT3 




z. / .J> 1 


9A Aft 
ZO.Oo 


1 HA 
3.\J4 


144 


A AAOQ 

U.UUzo 


k3 LJXv V 


:> 1 .Oh- 


in AS 


O 09 

z.yz 


144 


A AAyl 1 






9A OQ 


o oi 


1 AA 

144 


A AA>10 

U.UU4Z 






9C AC 


9 ©7 
Z.o / 


144 


A AA/17 




ZXj* /z 


9A 41 
ZO-*H 


9 70 

z. /y 


1 AA 

144 


A AAAA 




9R ^1 


9R 514. 


9 A7 
-Z.O/ 


1 AA 
144 


A AAQ4 


GSTM3 


98 91 


90 ni 


-.9 A1 


1 Ail 


A AAO^ , 


RPS6KB1 


^ X • 1 o 


10 AT 


9 Al 
Z.Oi 


1 AA 


A AAOO 


Src 


27 97 


97 6Q 


9 ^0 




A ni A^ 


Chkl 






9 ^7 


1 44 


A A1 11 
U.Ul 13 


IDl 


9R 7^ 


90 1 1 
zy, 1 D 


9 ASl 


1 44 


A A141 
U.U141 


EstRl 


24.22 


25.40 


-2 44 


144 


0 0160 


p27 


27.15 


27.51 


-2.41 


144 


0.0174 


CCNBl 


31.63 


30.87 


2.40 


144 


0.0176 


XIAP 


30.27 


30,51 


-2.40 


144 


0.0178 


Chk2 


31.48 


31.11 


2.39 


144 


0.0179 


CDC25B 


29.75 


29.39 


2.37 


144 


0.0193 


IGFIR 


28.85 


29.44 


-2.34 


144 


0.0209 
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AK055699 


33.23 


34.11 


-2.28 


144 


0.0242 


PI3KC2A 


31.07 


31.42 


-2.25 


144 


0.0257 


TGFB3 


28.42 


28,85 


-2.25 


144 


0.0258 


BAGIl 


28.40 


28.75 


-2,24 


144 


0,0269 


CYP3A4 


35-70 


35.32 


2.17 


1 A A 

144 


0.0317 


EpCAM 


28.73 


28.34 


2.16 


144 


0.0321 


VEGFC 


32.28 


31.82 


2.16 


144 


0.0326 


pS2 


28.96 


30,60 


-2.14 


144 


0,0341 


hENTl 


27.19 


26.91 


2.12 


144 


0.0357 


WISPI 


31.20 


31.64 


-2.10 


144 


0.0377 


HNF3A 


27.89 


28.64 


-2.09 


144 


0.0384 


NFKBp65 


33.22 


33.80 


-2.08 


144 


0.0396 


BRCA2 


33.06 


32.62 


2.08 


144 


0.0397 


xivjJrxc 






2.05 


144 


A t\A 1 A 

0.0414 


TKl 


32.27 


31.72 


2.02 


144 


0.0453 


VDR 


30.08 


29.73 


1.99 


144 


0.048S 



In the foregoing Table 4, lower (negative) t-values indicate higher expression (or lower 
CTs), associated with better outcomes, and, inversely, higher (positive) t-values indicate higher 
expression (lower CTs) associated with worse outcomes. Thus, for example, elevated expression 
5 of the FOXMl gene (t-value = 3.92, CT mean alive> CT mean deceased) indicates a reduced 
likelihood of disease free survival. Siimlarly, elevated expression of the CEGPl gene (t-value = - 
3.39; CT mean alive< CT mean deceased) indicates an increased likelihood of disease free 
survival. 

Based on the data set forth in Table 4, the overexpression of any of the following genes in 
10 breast cancer indicates a reduced likelihood of survival without cancer recurrence following 
surgery: FOXMl; PRAME; SKT15, Ki-67; CA9; NMEl; SURV; TFRC; YB-1; RPS6KB1; Src; 
Chkl; CCNBl; Chk2; CDC25B; CYP3A4; EpCAM; VEGFC; hENTl; BRCA2; EGFR; TKl; 
VDR. 

Based on the data set forth in Table 4, the overexpression of any of the following genes in 
15 breast cancer indicates a better prognosis for survival without cancer recurrence following 
surgery: BIcl2; CEGPl; GSTMl; PR; BBC3; GATA3; DPYD; GSTM3; IDl; EstRl; p27; XIAP; 
IGFIR; AK055699; P13KC2A; TGFB3; BAGIl ; pS2; WISPI ; HNF3A; NFKBp65. 
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Analysis of 108 ER positive patient by binary approach 

108 patients with nonnalized CT for estrogen receptor (ER) < 25.2 (i.e., ER positive 
patients) were subjected to separate analysis, A t test was performed on the groups of 
patients classified as 0 or 1 and the p-values for the differences between the groups for 
5 each gene were calculated. The following Table 5 lists the 12 genes where the p-value 

for the differences between the groups was <0.05. 



Table 5 



Gene/ 

SEQIDNO: 


Mean CT 
Alive 


MeanCT 
Deceased 


t-value 


■freedom 


P 


FRAME 


35.54 


33.88 


3.03 


106 


0.0031 


Bcl2 


28.24 


28.87 


-2.70 


106 


0.0082 


FOXMl 


33.82 


32.85 


2.66 


106 


0.089 


DIABLO 


30.33 


30.71 


-2.47 


106 


0.0153 


EPHXl 


28.62 


28.03 


2.44 


106 


0.0163 


HIFIA 


29.37 


28.88 


2.40 


106 


0.0180 


VEGFC 


32.39 


31.69 


2.39 


106 


0.0187 


Ki-67 


30.73 


29.82. 


2.38 


106 


0.0191 


IGFIR 


28.60 


29.18 


-2.37 


106 


0.0194 


VDR 


30.14 


29.60 


2.17 


106 


0.0322 


NMEl 


27J4 


26.80 


2.03 


106 


0.0452 


GSTM3 


28.08 


28.92 


-2.00 


106 


0.0485 



10 

For each gene, a classification algoritiim was utilized to identify the best threshold value 
(CT) for using each gene alone in predicting clinical outcome. 

Based on the data set forth in Table 5, overexpression of the following genes in ER- 

positive cancer is indicative of a reduced likelihood of survival without cancer recurrence 

15 following surgery: FRAME; FOXMl ; EPHXl ; HIFIA; VEGFC; Ki-67; VDR; NMEl . Some of 

these genes (FRAME; FOXMl; VEGFC; Ki-67; VDR; and NMEl) were also identified as 

indicators of poor prognosis in the previous analysis, not limited to ER-positive breast cancer. 

The overexpression of the remaining genes (EPHXl and HIFIA) appears to be negative 

indicator of disease free survival in ER-positive breast cancer only. Based on the data set forth in 

20 Table 5, overexpression of the following genes in ER-positive cancer is indicative of a better 
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prognosis for survival without cancer recurrence following surgery: Bcl-2; DIABLO; IGFIR; 
GSTM3. Of the latter genes, Bcl-2; IGFRl; and GSTM3 have also been identified as' indicators 
of good prognosis in the previous analysis, not limited to ER-positive breast cancer. The 
overexpression of DIABLO appears to be positive indicator of disease free survival in ER- 
5 positive breast cancer only. 

Analysis of multiple genes and indicators of outcome 

Two approaches were taken in order to determine whether using multiple genes would 
provide better discrimination between outcomes. 

First, a discrimination analysis was performed using a forward stepwise approach. 
10 Models were generated that classified outcome wifli greater discrimination than was obtained 
with any single gene alone. 

According to a second approach (time-to-event approach), for each gene a Cox 
Proportional Hazards model (see, e.g. Cox, D. R., and Oakes, D. (1984), Analysis of Survival 
Data^ Chapman and Hall, London, New York) was defined with time to recurrence or death as 
15 the dependent variable, and the expression level of the gene as the independent variable. The 
genes that have a p-value < 0,05 in the Cox model were identified. For each gene, the Cox 
model provides the relative risk (RR) of recurrence or death for a unit change in the expression of 
the gene. One can choose to partition the patients into subgroups at any threshold value of the 
measured expression (on the CT scale), where all patients with expression values above the 
20 threshold have higher risk, and all patients with expression values below the threshold have 
lower risk, or vice versa, depending on whether the gene is an indicator of good (RR>1.01) or 
poor (RR<L01) prognosis. Thus, any threshold value will define subgroups of patients with 
respectively increased or decreased risk. The results are sunmiarized in the following Tables 6 
and?. 

25 
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Table 6 

Cox Model Results for 146 Patients with Invasive Breast Cancer 



Gene 


Relative Risk (RR) 


SE Relative Risk 


p value 


rUAMI 


f\ CO 

O.Do 


0.15 


0.0002 


SrKI5 


0,51 


0.20 


0.0006 


JrRAME 


0.78 


0.07 


0,0007 


Bcl2 


1.66 


0.15 


0.0009 


/-XT"? Tit 

CEGPl 


1.25 


0.07 


0.0014 


GSTMl 


1.40 


0.11 


0.0014 


Ki67 


0.62 


0.15 


0.0016 


FIT* 

rR 


1.23 


0.07 


0.0017 


Contig51037 


0.81 


0.07 


0.0022 


NMEl 


0,64 


0.15 


0.0023 


YB-1 


0.39 


0.32 


0.0033 


TFRC 


0.53 


0.21 


0.0035 


BBC3 


1.72 


0.19 


0.0036 


OAi A3 


1.32 


0,10 


0.0039 


CA9 


0.81 


0.07 


0.0049 


oUKV 


0.69 


0.13 


0.0049 


uriu 


2.58 


0.34 


0.0052 




0.60 


0.18 


0.0055 




1.36 


0.12 


0.0078 


orcz 


0.39 


0.36 


0.0094 




1 1 

l.ol 


A i A 

0.19 


0.0109 




0.54 


0.25 


0.0122 




3.20 


A A ^ 

0.47 


0.0126 




U.Oo 


U-lO 


A A 1 CI 

U.UlDl 


IGFIR 


1.42 


0.15 


0.0153 


Chkl 


0.68 


0.16 


0.0155 


IDI 


1.80 


0.25 


0.0164 


p27 


1.69 


0.22 


0.0168 


Chk2 


0.52 


0.27 


0.0175 
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EstRl 


1.17 


0.07 


0.0196 


HNF3A 


1.21 


0.08 


0.206 


pS2 


1 1 o 


0.05 


0.0230 


BAGIl 


1 oo 


0.29 


0.0266 


AK055699 


1.24 


0.10 


0.0276 


pENTl 


0.51 


0.31 


0.0293 


EpCAM 


0.62 


0.22 


0.0310 


WISPl 


1.39 


0.16 


0.0338 


VEGFC 


0,62 


0.23 


0.0364 


TKl 


0.73 


0.15 


0.0382 


NFKBp65 


1.32 


0.14 


0.0384 




U,oo 


A OA 


U.04U4 


CYP3A4 


0.60 


0.25 


0.0417 


EGFR 


0.72 


0.16 


0.0436 



Table 7 

Cox Model Results for 108 Patients wih ER+ Invasive Breast Cancer 



Gene 


Relative Risk (RR) 


SB Relative Risk 


p-value 


FRAME 


0.75 


0.10 


0.0045 


Contig51037 


0.75 


0.11 


0.0060 


Blc2 


2.11 


0.28 


0.0075 


HIFIA 


0.42 


0.34 


0.0117 


IGFIR 


1.92 


0.26 


0.0117 


FOXMl 


0.54 


0.24 


0.0119 


EPHXl 


0.43 


0.33 


0.0120 


Ki67 


0.60 


0.21 


0.0160 


CDC25B 


0.41 


0.38 


0.0200 


VEGFC 


0.45 


0.37 


0.0288 


CTSB 


0.32 


0.53 


0.0328 


DIABLO 


2.91 


0.50 


0.0328 


p27 


1.83 


0.28 


0.0341 


CDHl 


0.57 


0.27 


0.0352 


IGFBP3 


0.45 


0.40 , 


0.0499 
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The binary and time-to-event analyses, with few exceptions, identified the same genes as 
prognostic markers. For example, comparison of Tables 4 and 6 shows that, with the exception 
of a single gene, the two analyses generated the same list of top 15 markers (as defined by the 
5 smallest p values). Furthermore, when both analyses identified the same gene, they were 
concordant with respect to the direction (positive or negative sign) of the correlation with 
survival/recurrence. Overall, these results strengthen the conclusion that the identified markers 
have significant prognostic value. 

For Cox models comprising more than two genes (multivariate models), stepwise entry 
10 of each individual gene into the model is performed, where the first gene entered is pre-selected 
fi-om among those genes having significant univariate p-values, and the gene selected for entry 
into the model at each subsequent step is the gene that best improves the fit of the model to the 
data. This analysis can be performed with any total number of genes. In the analysis the results 
of which are shown below, stepwise entry was performed for up to 10 genes. 
15 Multivariate analysis is performed using the following equation: 

RR=exp[coef(geneA) x Ct(geneA) + coef(geneB) x Ct(geneB) + coef(geneC) x 
Ct(geneC) + ]. 

In this equation, coefficiencts for genes that are predictors of beneficial outcome are 
positive numbers and coefficients for genes that are predictors of imfavorable outcome are 
20 negative numbers. The "Ct" values in the equation are ACts, i.e. reflect the difference between 
the average normalized Ct value for a population and the normalized Ct measured for the patient 
in question. The convention used in the present analysis has been that ACts below and above the 
population average have positive signs and negative signs, respectively (reflecting greater or 
lesser mRNA abundance). The relative risk (RR) calculated by solving this equation will 
25 indicate if the patient has an enhanced or reduced chance of long-term survival without cancer 
recurrence. 

Multivariate ^ene analysis of 147 patients with invasive breast carcinoma 
(a) A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
30 performed on the gene expression data obtained for all 147 patients with invasive breast 
carcinoma. Genes CEGPl, FOXMl, STK15 and FRAME were excluded fi:om this analysis. 
The following ten-gene sets have beeii identified by this analysis as having particularly strong 
predictive value of patient survival without cancer recurrence followmg surgical removal of 
primary tumor. 
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1. Bcl2, cyclinGl, NFKBp65, NMEl, EPHXl, TOP2B, DR5, TERC, Src, 
DIABLO; 

2. Ki67, XIAP, hENTl , TS, CD9, p27, cyclinGl , pS2, NFKBp65, CYP3 A4; 

3. GSTMl , XEAP, Ki67, TS, cyclinGl , p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

4. PR, NMEl, XIAP, upa, cyclinGl, Contig51037, TERC, EPHXl, 
ALDH1A3, CTSL; 

5. CA9, NMEl , TERC, cyclinGl , EPHXl , DPYD, Src, TOP2B, lNlFKBp65, 
VEGFC; 

6. TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65. 

(b) A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
performed on the gene expression data obtained for all 147 patients with invasive breast 
carcinoma, using an interrogation set including a reduced number of genes. The following ten- 
gene sets have been identified by this analysis as having particularly strong predictive value of 
patient survival without canc^ recurrence following surreal removal of primary tumor. 

1. Bcl2, FRAME, cyclinGl, FOXMl,NFKBp65,TS, XIAP, Ki67, CYP3A4, 
p27; 

2. FOXMl, cyclinGl, XIAP, Contig51037, FRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

3. FRAME, FOXMl, cyclinGl, XIAP. Contig51037, TS, Ki6, PDGFRa, p27, 
NFKBp65; 

4. Ki67, XIAP, FRAME, hENTl, contig51037, TS, CD9, p27, ErbB3, cyclinGl; 

5. STK15, XIAP, FRAME, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1; 

6. GSTMl, XL\P, FRAME, p27, Contig51037, ErbB3, GSTp, EREG, IDl, 
PLAUR; 

7. PR, FRAME, NMEl , XL\P, PLAUR, cyclinGl , ContigS 1037, TERC, 
EPHXl, DR5; 

8. CA9, FOXMl, cyclinGl, XL\P, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
p27; 

9. TFRC, XIAP, FRAME, p27, Contig5 1037, ErbB3, DPYD, TERC, NMEl , 
VEGFC; 

10. CEGPl, FRAME, hENTl, XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 
IDl, TS. 
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Multivariate analysis of patients with ER positive invasive breast carcinoma 
A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
performed on the gene expression data obtained for patients with ER positive invasive breast 
carcinoma. The following ten-gene sets have been identified by this analysis as having 
5 particularly strong predictive value of patient survival without cancer recurrence following 
surgical removal of primary tumor. 

1. FRAME, p27,IGFBP2,HIFlA,TIMP2,ILT2,CYP3A4, DDI, EstRl, 
DIABLO; 

2. Contig51037, EPHXl, Ki67, TIMP2, cydinGl, DPYD, CYP3A4, TP, AIBl, 
10 CYP2C8; 

3. Bcl2, hENTl , FOXMl , ContigS 1 037, cycIinGl , Contig46653, PTEN, 
CYP3 A4, TIMP2, AREG; 

4. HEFIA, FRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 

15 5. IGFIR, FRAME, EPHXl, Contig51037,cycUnGl,Bcl2,NMEl, PTEN, TBP. 

TIMP2; 

6. FOXMl, Contig51037, VEGFC. TBP, HIFIA, DPYD, RAD51C, DCR3, 
cyclinGl,BAGl; 

7. EPHXl, Contig51037, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
20 CYP2C8; 

8. Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

9. CDC25B, ContigS 1 037, hENTl , Bcl2, HLAG, TERC, NMEl , upa, IDl , CYP; 

10. VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Finl ; 

1 1 . CTSB, FRAME, p27, IGFBP2, EPHXl, CTSL, BAD, DR5, DCR3, XIAF; 
25 12. DIABLO, Ki67, hENTl, TIMP2, IDl , p27, KRT19, IGFBP2, TS, PDGFB; 

13. p27, FRAME, IGFBP2, HIFIA, TIMF2, ILT2, CYF3A4, IDl, EstRl, 
DIABLO; 

14. CDHl ; FRAME, VEGFC; HIFIA; DPYD, TIMF2, CYF3 A4, EstRl , RBF4, 
p27; 

30 15. IGFBP3, FRAME, p27, Bcl2, XLAF, EstRl , Ki67, TS, Src, VEGF; 

16. GSTM3, FRAME, p27,IGFBF3,XL\P,FGF2, hENTl, PTEN, EstRl, AFC; 

17. hENTl, Bcl2, FOXMl, Contig51037, CycUnGl, Contig46653, PTEN. 
CYP3A4, TIMF2, AREG; 

18. STK15, VEGFC, FRAME, p27, GCLC, hENTl, IDl, TIMP2, EstRl, MCPl; 
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19. NMEU PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 

20. VDR, Bcl2, p27, hENTl , p53, PI3KC2A, EIF4E, TFRC, MCM3, IDl ; 

21 . EIF4E, Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NMEl, 
HER2; 

22. CCNBl, FRAME, VEGFC, HIFIA, hENTl, GCLC, TIMP2, IDl, p27, upa; 

23. IDl, FRAME, DIABLO, hENTl, p27, PDGFRa, NMEl, BINl, BRCAl, TP; 

24. FBX05, FRAME, IGFBP3, p27, GSTM3, hENTl, XIAP, FGF2, TS, PTEN; 

25. GUS, HIAl A, VEGFC, GSTM3, DPYD, hENTl , FBX05, CA9, CYP, 
KRT18; 

26- Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, IDl, BRCAl, BINl, 
HBEGF. 

It is noteworthy that many of the foregoing gene sets include genes that alone did not 
have sufficient predictive value to qualify as prognostic markers under the standards discussed 
above, but in combination with other genes, their presence provides valuable information about 
the likelihood of long-term patient survival without cancer recurrence 

All references cited throughout the disclosure are hereby expressly incorporated by 
reference. 

While the present invention has been described with reference to what are considered to 
be the specific embodiments, it is to be understood that the invention is not limited to such 
embodiments. To the contrary, the invention is intended to cover various modifications and 
equivalents included within the spirit and scope of the appended claims. For example, while the 
disclosure focuses on the identification of various breast cancer associated genes and gene sets, 
and on the diagnosis and treatment of breast cancer, similar genes, gene sets and methods 
concerning other types of cancer are specifically within the scope herein. 
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TABLE 2 



Gene 


Accession No. 


Forward Primer 


Reverse Primer 


Amplic 






SEQ ID NO. 


SEQ ID NO. 


SEQ ID 


ABCB1 


NM_000927 


1 


2 


3 


ABCrCI 


Nlvi__004996 


4 


5 


6 


ABCC2 


NM_000392 


7 


8 


' 9 


ABCG3 


NM_0037o6 


10 


A A 
1 1 


12 


ABCC4 


NM_005845 


A 0 

13 


14 


15 


ABL1 


NM_0051 57 


A 0 

16 


17 


18 


ABL2 


KiK^i or\c«i era 

NM__P051 08 


A 0 

19 


20 


21 


AO I D 


NM_001 1Q1 


22 


23 


24 


AKT1 


NIv1_0U51d3 


25 


26 


27 


AKT3 


NM_0054o5 


28 


29 . 


30 


A 1 I~Vl_I -* 


NM_00Od89 


^A 

31 


32 


33 


A 1 rM_l -< AO 


NIvi_U0ud93 


A 

34 


35 


36 


APC 


NM_00003o 


37 


38 


39 


AREG 


NM_001o57 


40 


A A 

41 


42 


B2ivj 


Nlvl_p0404o 


43 . 


A A 

44 


45 


BAK1 


NM_UO I 100 


A 0 

46 


A^ 

47 


A 0 

48 


BAX 


Nlv1_004324 


49 


50 


51 


BCL2 


NM_000o33 


52 


53 


54 


BCL2L1 


iviAif t\r\A A r\A 

NM_00119 1 


55 


56 


57 


BIRCo 


KiKA t\i\A A t^XZ 

NM__0011oo 


58 


59 


60 


BIRC4 


Kin;! r\t\A A ^"7 

Nlvl_001 1 67 


61 


62 


63 


BIRC5 


KIRA f\f\A A r^a 

NM_001168 


64 


65 


66 


BKCA1 


NM_007295 


67 


68 


• 69 


BRCA2 


NM__000Q59 


70 


71 


72 


CCND1 


NM_001758 


73 


74 


75 


CDoZ 


Kill A C\t\C<^'^ A 

Nivi_0 00734 


76 


77 ' 


78 


Cu68 


NM_001251 


79 


80 


81 


CDC25A 


NM_001789 


82 


83 


84 


CDH1 


NM_004360 


85 


86 


87 


CDKN1 A 


nM_0 00389 


88 


89 


90 


CDKN1B 


NM_0 04064 


91 


92 


93 


CDKN2A 


NM_000077 


94 . 


95 


96 


CYP1 B1 


NM_000104 


97 


98 


99 


DHFR 


Nivi_000791 


100 


101 


102 


DPYD 


KIR Jl r\C\f\A A 

NM_000110 


103 


104 


105 


ECGF1 


Nlvi_001953 


106 


107 


108 


EGFR 


NM_005228 


109 


110 


111 


EIF4E 


Nm_001968 


112 


113 


114 


ERBB2 


NM_004448 


115 


116 


117 


ERBB3 


NM_001982 


118 


119 


120 


ESR1 


NM_000125 


121 


122 


123 


ESR2 


NM_001437 


124 


125 


126 


GAPD 


Nm_002046 


127 


128 


129 


GATA3 


Niv1_002051 


130 


131 


132 


GRB7 


NJvi_005310 


133 


134 


135 




MM nni '^ii 

INJVI \J\J \%j I I 


f 00 


10/ 


loo 


GSTP1 


NM 000852 


139 


140 


141 


GUSB 


NM 000181 


142 


143 


144 


hHGF 


M29145 


145 


146 


147 


HNF3A 


NM' 004496 


148 


149 


150 


ID2 


•NM 002166 


151 


152 


153 


IGF1 


NM 000618 


154 


155 


156 



i 
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IGFBPo 


Nlvi_Q 00598 


157 


ITGA7 


NM_002206 


160 


ITGB2 


NM__00021 1 


163 


KDR 


NM_002253 


loo 


KIT 


Nivi__0UU4:22 


•1 cry 

169 




NiVi__uouoyy 


1 /^2 


r\K I 1 / 


NM_;^uUU4z2 


175 


KK I O 


Mil;! AAHylO/l 


178 


- LFL 


NIVl_UOOZo/ 


181 




KIR* nnnoyic 


184 


lvlr\io/ 


InIvi_0UZ4 I / 


• 187 


MVP 


NM_0174oo 


190 


MYC 


Nlvij0024o7 


193 


PUG FA 


KIR if r\/*voc/%"7 

NIvI_002d07 


196 


PUGhD 


NM_002o0o 


199 


PUGrO 


NM__01 O^Uo 


202 




NIVI__UUdzUo 


0 ACT 

205 


rUGrKfcJ 


Nivl^JUUZbUy 


208 


PGf\l 


iNlVi^UUUZyi 


21 1 


PGK 


Nm^OOUyZb 


214 


OIIVIH 

PINT 


NIVl_yUoZZ1 


217 


.PLAU 


NIVl__UUZOOO 


00 A 

220 


PPiH 


NM_u0oo47 


223 


PTtN 


NM_pOOo 14 


226 


PTGS2 


NIv1_000963 


229 


KBP4 


NlVl_^00o744 


232 


KcLA 


Nlvl_02iy7o 


235 


DDI d Q 




238 


DDI Dn 




241 


oUL/Gr-'D 




^ A A 

244 


otzKPiNh: 1 


NM__UUUbU2 


247 




NJv1_00o05d 


. 250 


TDD 
1 BP 


MivI^OOo 1 94 


253 


irrl 


NM_003225 


256 


TrRC 


N1v1__003234 


259 


TK1 


NM_00325o 


262 




INIVI UWU*tO 


ZOO 


TNFSF6 


NM 000639 


268 


TOP2A 


NM 001067 


271 


TOP2B 


NM 001068 


274 


TP53 


NM 000546 


277 


TYMS 


NM 001071 


280 


VEGF 


NM 003376 


283 



158 
161 
164 
167 
170 
173 
176 
179 
182 
185 
188 
191 
194 
197 
200 
203 
206 
.209 
212 
215 
218 
221 
224 
227 
230 
233 
236 
239 
242 
245 
248 
251 
254 
257 
260 
263 
266 
269 
272 
275 
278 
281 
284 



159 

162 

165 

168 

171 

174 

177 

180 

183 

186 

189 

192 

195 

198 

201 

204 

207 

210 

213 

216 

219 

222 

225 

228 

231 

234 

237 

240 

243 

246 

249 

252 

255 

258 

261 

264 

267 

270 

273 

276 

279 

282 

285 
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TABLES 



GENE 


ACCESSION NO. 


SEQ ED NO: 


AK055699 


AK055699 


286 * 


BAG1 


NM 004323 


287 


BBC3 


NM 014417 


288 


Bci2 


NM 000633 


289 


BRCA2 


NM 000059 


290 


CA9 


NM 001216 


291 


CCNBi 


NM 031966 


292 


CDC25B 


NM 021874 


293 


CEGP1 


NM 020974 


294 


Chk1 


NM 001274 


295 


Chk2 


NM 007194 


296 


CYP3A4 


NM 017460 


297 


DIABLO 


NM 019887 


298 


DPYD 


NM 000110 


299 


EGFR 


NM 005228 


300 


EpCAM 


NM 002354 ^ 


301 


EPHX1 


NM 000120 


302 


EstR1 


NM 000125 


303 


FOXM1 


NM 021953 


• 304 • 


GATA3 


NM 002051 


305 


GSTM1 


NM 000561 


306 


GSTM3 


NM 000849 


307 


hENT1 


NM 004955 


308 


HIF1A 


NM 001530 


309 


HNF3A 


NM 004496 


310 


ID1 


NM 002165 


311 


IGF1R 


NM 000875 


312 * 




NM. 00241 7 


313 


NFKBp65 


NM 021975 


314 


NME1 


NM 000269 


315 


p27 


NM 004064 


316 


PI3KC2A 


NM 002645 


317 


PR 


NM 000926 


318 


PRAME 


NM 006115 


319 


pS2 


NM 003225 


320 


RPS6KB1 


NM 003161 


321 


Src 


NM 004383 


322 


STK15 


NM 003600 


323 


SURV 


NM 001168 


324 


TFRC 


NM 003234 


325 


TGFB3 


NM 003239 


326 


TK1 


NM 003258 


327 


VDR 


NM 00037B 


328 


VEGFC 


NM 005429 


329 


W1SP1 


NM 003882 


330 


X!AP 


NM 001167 


331 


YB-1 


NM 004559 


332 


ITGA7 


NM 002206 


333 


PDGFB 


NM 002608 


334 


Upa 


NM 002658 


335 " 


TBP 


NM 003194 


336 
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TABLE 3 rContinuedY 



PDGFRa 


NM 006206 


337 


Pin1 


NM 006221 


338 


CYP 


NM 006347 


339 


RBP4 


NM 006744 


340 


BRCA1 


NM 007295 


341 


APC 


NM 000038 


342 


GUS 


NM 000181 


'343 


CD18 


NM 000211 


344 


PTEN 


NM 000314 


345 


P53 


NM 000546 


346 


ALDH1A3 


NM 000693 


347 


GSTp 


NM 000852 


348 


TOP2B 


NM 001068 


349 


TS 


NM 001071 


350 


Bclx 


NM 001191 


351 


AREG 


NM 001657 


352 


TP 


NM 001953 


353 


EIF4E 


NM 001968- 


354 ' -,, \ 


ErbB3 . 


NM 001982 


355 


EREG 


NM 001432 


356 


GCLC 


NM 001498 


357 


CD9 


NM 001769 


358 


HB-EGF 


NM 001945 


359 


1GFBP2 


NM 000597 


360 


CTSL 


NM 001912 


361 


PREP 


NM 002726 


362 


CYP3A4 


NM 017460 


363 


ILT-2 


NM 006669 


364 


MCM3 


NM 002388 


365 


KRT19 


NM 002276 


366 


KRT18 


NM 000224 


367 


TIMP2 


NM 003255 


368 


BAD 


NM 004322 


369 


CYP2C8 


NM 030878 


370 


DCR3 


NM 016434 


371 


PLAUR 


NM 002659 


372 


PI3KC2A 


NM 002645 


373 


FGF2 


NM 002006 


374 


HLA-G 


NM 002127 


375 


AIB1 


NM 006534 


376 


MCP1 


NM 002982 


377 


Contig46653 


Contig46653 


378 


RhoC 


NM 005167 


379 


DR5 


NM 003842 


380 


RAD51C 


NM 058216 


381 


BIN1 


NM 004306 


382 


VDR 


NM 000376 


383 


TERC 


U86046 


384 
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WHAT IS CLAIMED IS : 

L A method for predicting clinical outcome for a patient diagnosed with cancer, 
comprising 

determining the expression level of one or more genes, or their expression products, 
selected from the group consisting of p53BP2, cathepsin B, cathepsin L, Ki67/MiBl, and 
thymidine kinase in a cancer tissue obtained from the patient, normalized against a control gene 
or genes, and compared to the amount found in a reference cancer tissue set, 

wherein a poor outcome is predicted if: 

(a) the expression level of p53BP2 is in the lower 10*^ percentile; or 

(b) the expression level of either cathepsin B or cathepsin L is in the upper 10**' 
percentile; or 

(c) the expression level of any either Ki67/MiBl or thymidine kinase is in the upper 
lO**' percentile. 

2. The method of claim 1 wherein poor clinical outcome is measured in terms of 
shortened survival or increased risk of cancer recurrence. 

3. The method of claim 2 wherein poor clinical outcome is measured in terms of 
shortened survival or increased risk of cancer recurrence following surgical removal of the 
cancer. 

4. The method of claim 1 wherein the cancer is selected from the group consisting of 
breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, 
pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinaiy tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 

5. The method of claim 4 wherein the cancer is breast cancer. 

6. The method of claim 5 wherein the expression level of p53BP2 is determined. 

7. The method of claim 5 wherein the expression levels of cathepsin B and cathepsin 
L are determined. 

8. The method of claim 5 wherein the expression level of cathepsin L is determined. 
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9. The method of claim 5 wherein the expression levels of Ki67/MiBl and 
thymidine kinase are determined. 

5 10. The method of claim 5 wherein the expression level of Ki67/MiBl is determined. 

11. The method of claim 5 wherein the expression level of thymidine kinase is 
determined. 

10 12. The method of claim 1 wherein the expression level of more than one gene, or 

gene product, is determined. 

13. The method of claim 1 wherein the expression level of more than two genes is 
determined. 

15 

14. The method of claim 13 fiirther comprising the step of subjecting the expression 
data to multivariate analysis using the Cox Proportional Hazards model. 

15. The method of claim 1 wherein the expression level is determined using RNA 
20 obtained from a formalin-fixed, paraffin-embedded tissue sample. 

16. The method of claim 1 wherein the expression level is determined by reverse 
phase polymerase chain reaction (RT-PCR). 

25 17. The method of claim 1 6 wherein said RNA is fragmented. 

18. A method of predicting the likelihood of the recurrence of cancer following 
treatment in a cancer patient, comprising determining the expression level of p27, or its 
expression product, in a cancer tissue obtained from said patient, normalized against a control 

30 gene or genes, and compared to the amoimt foimd in a reference cancer tissue set, wherein an 
expression level in the upper 10th percentile indicates decreased risk of recurrence following 
treatment. 

19. The method of claim 18 wherein the cancer is selected from the group consisting 
35 of breast cancer, colon cancer, Itmg cancer, prostate cancer, hepatocellular cancer, gastric cancer, 
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pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 



20. The method of claim 19 wherein the cancer is breast cancer. 

5 

21. The method of claim 20 wherein the expression level is determined following 
surgical removal of cancer. 

22. The method of claun 20 wherein the expression level is determined using RNA 
1 0 obtained from a formalin-fixed, parafSn-embedded tissue sample. 

23. The method of claim 22 wherein said RNA is fragmented. 

24. The method of claim 22 wherein the expression level is determined by reverse 
1 5 phase polymerase chain reaction (RT-PCR). 

25. A method for classifying cancer comprising, determining the expression level of 
two or more genes selected from the group consisting of Bcl2, hepatocyte nuclear factor 3, ER, 
ErbB2 and Grb7, or their expression products, in a cancer tissue, normalized against a control 

20 gene or genes, and compared to the amount found in a reference cancer tissue set, wherein (i) 
tumors expressing at least one of Bcl2j hepatocyte nuclear factor 3, and ER, or their expression 
products, above Ihe mean expression level in the reference tissue set are classified as having a 
good prognosis for disease free and overall patient survival following treatment; and (ii) tumors 
expressing elevated levels of ErbB2 and Grb7, or their expression products, at levels ten-fold or 

25 more above the mean expression level in the reference tissue set are cleissified as having poor 
prognosis of disease free and overall patient survival following treatment. 

'J 

26. The method of claim 26 wherein the cancer is selected from the group consisting 
of bre^t cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, 

30 pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 

27. The method of claim 26 wherein the cancer is breast cancer. 
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28. The method of claim 26 wherein the expression level is determined following 
surgical removal of cancer. 

29. The method of claim 26 wherem the expression level is determined using RNA 
5 obtained from a formalin-jBxed, paraffin-embedded tissue sample. 

30. The method of claim 29 wherein said RNA is fragmented. 

31. The method of claim 29 wherein the expression level is determined by reverse 
1 0 phase polymerase chain reaction (RT-PCR). 

32. A method of predicting the likelihood of long-term survival of a breast cancer 
patient without the recurrence of breast cancer, following surgical removal of the primary tumor, 
comprising determining the expression level of one or more prognostic RNA transcripts or their 

15 product in a breast cancer tissue sample obtained from said patient, normalized against the 
expression level of all RNA transcripts or their products in said breast cancer tissue sample, or of 
a reference set of RNA transcripts or their products, wherein the prognostic transcript is the 
transcript of one or more genes selected from the group consisting of: FOXMl, FRAME, Bcl2, 
STK15, CEGPl, Ki-67, GSTMl, CA9, PR, BBC3, NMEl, SURV, GATA3, TFRC, YB-1, 

20 DPYD, GSTM3, RPS6KB1, Src, Chkl, IDl, EstRl, p27, CCNBl, XIAP, Chk2, CDC25B, 
IGFIR, AK055699, P13KC2A, TGFB3, BAGIl, CYP3A4, EpCAM, VEGFC, pS2, hENTl, 
WlSPl , HNF3A, NFKBp65, BRCA2, EGFR, TKl, VDR, Contig51037, pENTl, EPHXl, IFIA, 
DIABLO, CDHl, HIFla, IGFBP3, CTSB, and Her2, wherein overexpression of one or more of 
FOXMl, FRAME, STK15, Ki-67, CA9, NMEl, SURV, TFRC, YB-1, RPS6KB1, Src, Chkl, 

25 CCNBl, Chk2, CDC25B, CYP3A4, EpCAM, VEGFC, hENTl, BRCA2, EGFR, TKl, VDR, 
EPHXl, IFIA, Contig51037, CDHl, HIFla, IGFBP3, CTSB, Her2, and pENTl indicates a 
decreased likelihood of long-term survival without breast cancer recurrence, and the 
overexpression of one or more of Bcl2, CEGPl, GSTMl, PR, BBC3, GATA3, DPYD, GSTM3, 
IDl, EstRl, p27, XIAP, IGFIR, AK055699, P13KC2A, TGFB3, BAGIl, pS2, WISPl, HNF3A, 

30 NFKBp65, and DIABLO indicates an increased likelihood of long-term survival without breast 
cancer recurrence. 

33. The miethod of claim 32 comprising determining the expression level of at least 
two of said prognostic transcipts or their expression products. . 
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34. The method of claim 32 wherein the breast cancer is invasive breast carcinoma, 
comprising determination of the expression levels of the transcripts of the following genes, or 
their expression products: FOXMl, FRAME, Bcl2, STK15, CEGPl, Ki-67, GSTMl, PR, 

5 BBC3, NMEl, SURV, GATA3, TFRQ YB-1, DPYD, CA9, Contig51037, RPS6K1 and Her2. 

35. - The method of claim 32 wherein said breast cancer is characterized by 
overexpression of the estrogen receptor (ER). 

10 36, The method of claim 35 comprising determiriSiotfof "the expression levels of the 

transcripts of at least two of the following genes, of their expression products: FRAME, Bcl2, 
FOXMl, DIABLO, EPHXl, fflFlA, VEGFC, Ki-67, IGFIR, VDR, NMEl, GSTM3, 
Contig51037, CDC25B, CTSB, p27, CDHl, and IGFBF3. 

15 37. The method of claim 32 wherein the expression level of one or more prognostic 

RNA transcripts is determined. 

38. The method of claim 37 wherein said RNA is isolated jfrom a fixed, wax- 
embedded breast cancer tissue specimen of said patient. 

20 

39. An array comprising polynucleotides hybridizing to the following genes: 
FOXMl, FRAME, 'Bc12, STK15, CEGFl, Ki-67, GSTMl, PR, BBC3, NMEl, SURV, GATA3, 
TFRC, YB-1, DPYD, CA9, Contig51037, RPS6K1 and Her2, immobilized on a solid surface. 

25 40. The array of claim 39 comprising poljmucleotides hybridizing to the following 

genes: FOXMl, FRAME, Bcl2, STK15, CEGFl, Ki-67, GSTMl, CA9, PR, BBC3, NMEl, 
SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RFS6KB1, Src, Chkl, IDl, EstRl, p27, 
CCNBl, XIAF, Chk2, CDC25B, IGFIR, AK055699, P13KC2A, TGFB3, BAGIl, CYF3A4, 
EpCAM, VEGFC, pS2, hENTl, WISPl, HNF3A, NFKBp65, BRCA2, EGFR, TKl, VDR, 

30 Contig51037, pENTl, EPHXl, IFIA, CDHl, HIFla, IGFBP3, CTSB, Her2 and DIABLO, 
immobilized on a solid surface. 
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41- A method of predicting the likelihood of long-term survival of a patient 
diagnosed with invasive breast cancer, without the recurrence of breast cancer, following surgical 
removal of the primary tumor, comprising the steps of: 

(1) detemiining the expression levels of the RNA transcripts or the expression 
products of genes of a gene set selected from the group consisting of 

(a) Bcl2, cyclinGl, NFKBp65, NMEl, EPHXl, TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFKBp65, CYP3A4; 

(c) GSTMl, XIAP, Ki67, TS, cycHnGl, p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

(d) PR, NMEl, XIAP, upa, cyclinGl, Contig51037, TERC, EPHXl, 
ALDH1A3, CTSL; 

(e) CA9, NMEl, TERC, cyclinGl, EPHXl, DPYD, Src, TOP2B, NFKBp65, 
VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65; 

(g) Bcl2,PRAME, cyclinGl, FOXMl,NFKBp65,TS, XIAP, Ki67, CYP3A4, 
p27; 

(h) FOXMl, cyclinGl, XIAP, Contig51037, PRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) PRAME, FOXMl, cyclinGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
NFKBp65; 

(i) Ki67, XIAP, PRAME, hENTl , contigS 1 037, TS, CD9, p27, ErbB3, cyclinGl ; 
(k) STK15, XIAP, PRAME, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1 ; 
a) GSTMl, XIAP, PRAME, p27, Contig51037, ErbB3, GSTp, EREG, IDl, 
PLAUR; 

(m) PR, PRAME, NMEl, XL\P, PLAUR, cyclinGl, Contig51037, TERC, 
EPHXl, DR5; 

(n) CA9, FOXMl, cyclinGl, XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
p27; 

(o) TFRC, XL\P, PRAME, p27, ContigS 1 037, ErbB3, DPYD, TERC, NMEl , 
VEGFC; and 

(p) CEGPl, PRAME, hENTl, XL^P, Contig51037, ErbB3, DPYD, NFKBp65, 
IDl.TS 
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in a breast cancer tissue sample obtained firom said patient, normalized against the 
expression levels of all RNA transcripts or their products in said breast cancer tissue sample, or 
of a reference set of KNA transcripts or their products; 

(2) subjecting the data obtained in step (a) to statistical analysis; and 

(3) determining whether the likelihood of said long-terai survival has 
increased or decreased. 

42. A method of predicting the likelihood of long-term survival of a patient 
diagnosed with estrogen receptor (ER)-positive invasive breast cancer, without the recurrence of 
breast cancer, following surgical removal of the primary tumor, comprising the steps of: 

(1) determining the expression levels of the RNA transcripts or the expression 
products of genes of a gene set selected from the group consisting of 

(a) FRAME, p27, IGFBP2, HIFIA. TIMP2, ILT2. CYP3A4, IDl, EstRl, 
DIABLO; 

(b) Contig51037, EPHXl, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
CYP2C8; 

(c) Bcl2, hENTl, FOXMl, Contig51037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIFIA, PRAME,p27,IGFBP2,TIMP2,ILT2,CYP3A4, IDl, EstRl, 
DIABLO; 

(e) IGFIR, FRAME, EPHXl, Contig51037, cyclinGl, Bcl2, NMEl, PTEN. TBP, 
TIME2; 

(f) FOXMl, Contig51037, VEGFC, TBP, HIFIA, DPYD, RAD51C, DCR3, 
cyclinGl, BAGl; 

(g) EPHXl , ContigS 1037, Ki67, TIMP2, cyclinGl , DPYD, CYP3 A4, TP, AIBl , 
CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 

(i) CDC25B, Contig51037, hENTl, Bcl2, HLAG, TERC, NMEl, upa, IDl, CYP; 
0') VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

(k) CTSB, FRAME, p27, IGFBP2, EFHXl, CTSL, BAD, DR5, DCR3, XIAP; 
(I) DIABLO, Ki67, hENTl , TIMP2, IDl, p27, KRTl 9, IGFBP2, TS, PDGFB; 
(m) p27, FRAME, IGFBP2, HIFIA, TIMP2,ILT2,CYP3A4, IDl, EstRl, 
DIABLO; 
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(n) CDHl ; FRAME, VEGFC; HIFl A; DPYD, TIMP2, CYP3 A4, EstRl , RBP4, 
p27; 

(o) IGFBP3, FRAME, p27, Bcl2, XIAF, EstRl , Ki67, TS, Src, VEGF; 

(p) GSTM3, FRAME, p27, IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl , APC; 

(q) hENTl, Bcl2, FOXMl, Contig51037, CyclinGl, Contig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(r) STK15, VEGFC, FRAME, p27, GCLC, hENTl, IDl, TIMP2, EstRl, MCPl; 
(s) NMEl, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 
(t) VDR, Bcl2, p27, hENTl , p53, PI3KC2A, EIF4E, TFRC, MCM3, IDl ; 
(u) EIF4E, Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NMEl, 
HER2; 

(v) CCNBl, FRAME, VEGFC, HIFIA, hENTl, GCLC, TIMF2, IDl, p27, upa; 
(w) IDl, FRAME, DIABLO, hENTl, p27, FDGFRa, NMEl, BESfl, BRCAl, TP; 
(x). FBX05, FRAME, IGFBP3, p27, GSTM3, hENTl , XIAP, FGF2, TS, PTEN; 
(y) GUS, HL\1 A, VEGFC, GSTM3, DPYD, hENTl , EBX05, CA9, CYP, 
KRT18;and 

(z) BclXi Bcl2, hENTl, Contig51037, HLAG, CD9, IDl, BRCAl, BINl, 
HBEGF; 

(2) subjectmg the data obtained in step (1 ) to statistical analysis; and 

(3) determinmg whether the likelihood of said long-tenn survival has 
increased or decreased. 

43 . The method of claim 41 or claim 42 wherein said statistical analysis is performed 

by vising the Cox Proportional Hazards model. 

I 

44. An array comprising polynucleotides hybridizing to a gene set selected from the 
group consisting of 

(a) Bcl2, cyclinGl , NFKBp65, NMEl, EPHXl , TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFKBp65, CYP3A4; 

(c) GSTMl, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

(d) PR, NMEl , XIAP, upa, cyclinGl , Contig51037, TERC, EPHXl , 
ALDH1A3, CTSL; 
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(e) CA9, NMEl, TERC, cyclinGl, EPHXl, DPYD. Src, TOP2B, NFKBp65, 
VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl , p27, CYP3A4, pS2, ErbB3, NFKBp65; 

(g) Bcl2, FRAME, cyclinGl, FOXMl, NFKBp65, TS, XIAP, Ki67, CYP3A4, 
p27; 

(h) FOXMl, cyclinGl, XIAP, Contig51037, FRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) FRAME, FOXMl, cyclinGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
MFKBp65; 

0) Ki67, XIAP, PRAAIE, hENTl, contigS 1037, TS, CD9, p27, ErbB3, cyclinGl ; 
(k) STKl 5, XIAP, FRAME, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1; 
(1) GSTMl, XIAP, FRAME, p27, Contig51037, ErbB3, GSTp, EREG, IDl, 
PLAUR; 

(m) PR, FRAME, NMEl, XIAP, PLAUR, cyclinGl, ContigS 1037, TERC, 
EPHXl, DR5; 

(n) CA9, FOXMl , cyclinGl , XL\P, TS, Ki67, NFKBp65, CYP3 A4, GSTM3, 
p27; 

(o) TFRC, XL\P, FRAME, p27, Contig51037, ErbB3, DPYD, TERC, NMEl, 
VEGFC; and 

(p) CEGPl, FRAME, hENTl , XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 

ID1,TS, 
immobilized on a solid surface. 

45. An array comprising polynucleotides hybridizing to a gene set selected from the 
group consisting of 

(a) FRAME, p27, IGFBP2, HIFl A, TIMP2, ILT2, CYP3 A4, EDI , EstRl , 
DL\BLO; 

(b) ContigS 1 037, EPHXl , Ki67, T1MP2, cyclinGl, DPYD, CYP3 A4, TP, AIBl , 
CYP2C8; 

(c) Bcl2, hENTl, FOXMl, Contig51037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIFl A, FRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 
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(e) IGFIR, FRAME, EPHXl, Contig51037, cyclinGl, Bcl2, NMEl, PTEN, TBP, 
TIMP2; 

(f) FOXMl, Contig51037, VEGFQ TBP, HIFIA, DPYD, RAD51C, DCR3, 
cyclinGl, BAGl; 

5 (g) EPHXl, Contig51037, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 

CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

(i) CDC25B,Contig51037,hENTl,Bcl2,HLAG, TERC, NMEI, upa, ID1,CYP; 
0) VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

10 (k) CTSB, FRAME, p27,IGFBP2, EPHXl, CTSL, BAD, DR5,DCR3,XIAP; 

0) DIABLO, Ki67, hENTl , TIMP2, IDl , p27, KRTl 9, IGFBP2, TS, PDGFB; 
(m) p27, FRAME, IGFBP2, HIFIA, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; 

(n) CDHl ; FRAME, VEGFC; HIFIA; DPYD, TIMP2, CYP3 A4, EstRl , RBP4, 
15 p27; 

(o) IGFBP3, FRAME, p27, BcI2, XIAP, EstRl, Ki67, TS, Src, VEGF; 
(p) GSTM3, FRAME. p27,IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl, AFC; 
(q) hENTl, Bcl2, FOXMl, Contiig51037, CyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 
20 (r) STK15, VEGFC, FRAME, p27,GCLC, hENTl, IDl, TIMP2, EstRl, MCPl; 

(s) NMEl, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 
^ (t) VDR, Bcl2, p27, hENTl , p53, PI3KC2A, EIF4E, TFRC, MCM3, ID 1 ; 
(u) EIF4E, Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NMEl, 
HER2; 

25 (v) CCNBl, FRAME, VEGFC, HBFl A, hENTl, GCLC, TIMP2, IDl , p27, upa; 

(w) IDl, FRAME, DL^iBLO, hENTl, p27, PDGFRa, NMEl, BINl, BRCAl, TP; 
(X) FBX05, FRAME, IGFBP3, p27, GSTM3, hENTl , XIAP, FGF2, TS, PTEN; 
(y) GUS,HIA1A, VEGFC, GSTM3, DPYD, hENTl, FBX05,CA9,CYP, 

KRTl 8; and 

30 (z) Bclx, BcI2, hENTl, Contlg5 1037, HLAG, CD9, IDl, BRCAl, BINl, 

HBEGF, 
immobilized on a solid surface. 
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Progesterone Receptor Comparison 
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' SEQUENCE LISTING 

<110> GENOMIC HEALTH 
Baker, Joffre B. 
Cronin, Maureen T. 
Kiefer, Michael C. 
Shak, Steve 

Walker, Michael Graham 

<120> GENE EXPRESSION PROFILING IN BIOPSIED TUMOR TISSUES 

<130> 39740-OOOlPCT 

<140> to be assigned 
<141> 2003-03-12 

<150> US 60/412,049 
<151> 2002-09-18 

<150> US 60/364,890 
<151> 2002-03-13 

<160> 384 

<170> FastSEQ for windows Version 4.0 

<210> 1 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gtcccaggag cccatcct 18 

<210> 2 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 2 

cccggctgtt gtctccata 19 

<210> 3 

<211> 68 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gtcccaggag cccatcctgt ttgactgcag cattgctgag aacattgcct atggagacaa 60 
cagccggg 68 

<210> 4 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 4 

tcatggtgcc cgtcaatg 18 

<210> 5 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 5 

cgattgtctt tgctcttcat gtg 23 

Page 1 
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<210> .6 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 6 

tcatggtgcc cgtcaatgct gtgatggcga tgaagaccaa gacgtatcag girggcccaca 60 
tgaagagcaa agacaatcg 79 

<210> 7 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 7 

aggggatgac ttggacacat 20 

<210> 8 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 8 

aaaactgcat ggctttgtca 20 

<210> 9 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 9 

aggggatgac ttggacacat ctgccattcg acatgactgc aattttgaca aagccatgca 60 
gtttt .65 

<210> 10 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 10 

tcatcctggc gatctacttc ct 22 

<210> 11 . 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 11 

ccgttgagtg gaatcagcaa 20 

<210> 12 

<211> 91 

<212> DNA 

<213> Homo sapiens 

<400> 12 

tcatcctggc gatctacttc ctctggcaga acctaggtcc ctctgtcctg gctggagtcg 60 
. ctttcatggt cttgctgatt ccactcaacg g 91 

<210> 13 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<4oo> 13 

agcgcctgga atctacaact .20 

Page 2 . 
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<210> 14 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 14 

agagcccctg gagagaagat 20 

<210> 15 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 15 

agcgcctgga atctacaact cggagtccag tgttttccca cttgtcatct tctctccagg 60 
ggctct 66 

<210> 16 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 16 : 

gcccagagaa ggtctatgaa ctca 24 

<210> 17 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 17 

gtttcaaagg cttggtggat tt 22 

<210> 18 

<211> 94 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gcccagagaa ggtctatgaa ctcatgcgag catgttggca gtggaatccc tctgaccggc 60 
cctcctttgc tgaaatccac caagcctttg aaac .94 

<210> 19 
<211> 21 

<212> DNA . , 

<213> Homo sapiens 

<400> 19 . . 

cgcagtgcag ctgagtatct g 21 

<210> 20 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 20 

tgcccagggc tactctcact t 21 

<210> 21 
<211> 80 
<212> DNA 

<213> Homo sapiens 
<400> 21 

cgcagtgcag ctgagtatct gctcagcagt ctaatcaatg gcagcttcct ggtgcgagaa 60 
agtgagagta gccctgggca 80 

<210> 22 

Page 3 
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<211> 21 

<212> DNA 

<213> Homo sapiens 



39740-OOOlPCT . txt 



<400> 22 

cagcagatgt ggatcagcaa g 21 

<210> 23 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 23 . 

gcatttgcgg tggacgat 18 

<210> 24 
<211> 66 
<212> DNA 

<213> Homo sapiens 
<400> 24 

cagcagatgt ggatcagcaa gcaggagtat gacgagtccg gcccctccat cgtccaccgc 60 
aaatgc 66 

<210> 25 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 25 

cgcttctatg gcgctgagat 20 

<210> 26 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 26 

tcccggtaca ccacgttctt 20 

<210> 27 

<211> 71 

<212> DNA 

<213> Homo sapiens 

<400> 27 

cgcttctatg gcgctgagat tgtgtcagcc ctggactacc tgcactcgga gaagaacgtg 60 
gtgtaccggg a . .71 

<210> 28 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 28 

ttgtctctgc cttggactat ctaca 25 

<210> 29 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ccagcattag attctccaac ttga 24 

<210> 30 
<211> 75 
<212> DNA 

Page 4 
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<213> 


-Homo 


<400> 


30 


ttgtctctgc 


agaatctaat 


<210> 


31 


<211> 


25 


<212> 


DNA 


<213> 


Homo 


<400> 


31 


gaaggagata 


<210> 


32 


<211> 


18 


<212> 


DNA 


<213> 


Homo 


<400> 


32 


cgccacggag 


<210> 


33 


<211> 


74 


<212> 


DNA 


<213> 


Homo 


<400> 


33 


gaaggagata 


ggatctccgt 


<210> 


34 


<211> 


21 


<212> 


DNA 


<213> 


Homo 


<400> 


34 


tggtgaacat 


<210> 


35 


<211> 


22 


<212> 


DNA 


<213> 


Homo 


<400> 


35 


gaaggcgatc 


<210> 


36 


<211> 


80 


<212> 


DNA 


<213> 


Homo 


<400> 


36 


tggtgaacat 


agatcaacaa 


<210> 


37 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


37 


ggacagcagg 


<210> 


38 


<211> 


20 


<212> 


DNA 


<213> 


Homo 



75 



25 



18 



74 



21 



22 



80 



20 
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<400> 38 

acccactcga tttgtttctg 20 

<2iO> 39 

<211> 69 

<212> DNA 

<213> Homo sapiens 

<400> 39 

ggacagcagg aatgtgtttc tccatacagg tcacggggag ccaatggttc agaaacaaat 60 
cgagtgggt 69 

<210> 40 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 40 

tgtgagtgaa atgccttcta gtagtga^ 27 

<210> 41 
<211> 27 

<212> DNA . - 

<213> Homo sapiens 

<400> 41 , 

ttgtggttcg ttatcatact cttctga 27 

<210> 42 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 42 

tgtgagtgaa atgccttcta gtagtgaacc gtcctcggga gccgactatg actactcaga 60 
agagtatgat aacgaaccac aa .82 

<210> 43 

<211> 19 . 

<212> DNA 

<213> Homo sapiens 

<400> 43 

gtctcgctcc gtggcctta 19 

<210> 44 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 44 

cgtgagtaaa cctgaatctt tgga 24 

<210> 45 

<211> 93 

<212> DNA 

<213> Homo sapiens 

<400> 45 

gtctcgctcc gtggccttag ctgtgctcgc gctactctct ctttctggcc tggaggctat 60 
ccagcgtact ccaaagattc aggtttactc acg 93 

<210> 46 

<211> 20 . 

<212> DNA 

<213> Homo sapiens 
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<400> 46 

ccattcccac cattctacct 20 

<210> 47 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 47 

gggaacatag acccaccaat 20 

<2lO> 48 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 48 ' 

ccattcccac cattctacct gaggccagga cgtctggggt gtggggattg gtgggtctat 60 
gttccc 66 

<210> 49 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 49 

ccgccgtgga cacagact 18 

<210> 50 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 50 

ttgccgtcag aaaacatgtc a 21 

<210> 51 
<211> 70 
<212> DNA 

<213> Homo sapiens . 

<400> 51 . ' 

ccgccgtgga cacagactcc ccccgagagg tctttttccg agtggcagct gacatgtttt 60 
ctgacggcaa 70 

<210> 52 

<211> 25 . 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cagatggacc tagtacccac tgaga .25 

<210> 53 

<211> 24 

<212>. DNA 

<213> Homo sapiens 

<400> 53 . 

cctatgattt aagggcattt ttcc .24 

<210> 54 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 54 

cagatggacc tagtacccac tgagatttcc acgccgaagg acagcgatgg gaaaaatgcc 60 
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cttaaatcat agg 73 

<210> 55 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 55 ' 

cttttgtgga actctatggg aaca 24 

<210> 56 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 56 

cagcggttga agcgttcct 19 

<210> 57 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 57 

cttttgtgga actctatggg aacaatgcag cagccgagag ccgaaagggc caggaacgct 60 
tcaaccgctg 70 

<210> 58 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 58 

ggatatttcc gtggctctta ttca 24 
<210> 59 

<211> 25 ... 

<212> DNA 

<213> Homo sapiens 

<400> 59 

cttctcatca aggcagaaaa atctt 25 

<210> 60 

<211> 86 

<212> DNA 

<213> Homo sapiens 

<400> 60 

ggatatttcc gtggctctta ttcaaactct ccatcaaatc ctgtaaactc cagagcaaat 60 
caagattttt ctgccttgat gagaag 86 

<210> 61 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 61 

gcagttggaa gacacaggaa agt 23 
<210> 62 

<211> 21 ' 

<212> DNA 

<213> Homo sapiens 

<400> 62 

^gcgtggcac tattttcaag a 21 
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<210> 63 

<211> 77 

<212> DNA 

<213> Homo sapiens 

<400> 63 

gcagttggaa gacacaggaa agtatcccca aattgcagat ttatcaacgg cttttatctt 60 
gaaaatagtg ccacgca 77 

<210> 64 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 64 

tgttttgatt cccgggctta 20 

<210> 65 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 65 

caaagctgtc agctctagca aaag 24 

<210> 66 

<211> 80 

<212> DNA 

<213> Homo sapiens 

<400> 66 

tgttttgatt cccgggctta ccaggtgaga agtgagggag gaagaaggca gtgtcccttt 60 
tgctagagct gacagctttg 80 

<210> 67 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 67 

tcagggggct agaaatctgt 20 

<210> 68 
<211> 20 

<212> DNA - 
<213> Homo sapiens 

<400> 68 

ccattccagt tgatctgtgg 20 

<210> 69 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 69 

tcagggggct agaaatctgt tgctatgggc ccttcaccaa catgcccaca gatcaactgg 60 
aatgg 65 

<210> 70 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 70 ... . 

agttcgtgct ttgcaagatg 20 

<210> 71 
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<211> 20 . ' 

<212> DNA 
<213> Homo sapi ens 

<400> 71 

aaggtaagct gggtctgctg 20 

<210> 72 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 72 

agttcgtgct ttgcaagatg gtgcagagct ttatgaagca gtgaagaatg cagcagaccc 60 
agcttacctt 70 

<210> 73 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 73 

gcatgttcgt ggcctctaag a 21 

<210> 74 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 74 

cggtgtagat gcacagcttc tc 22 

<210> 75 

<211> 69 

<212> DNA 

<213> Homo sapiens 

<400> 75 

gcatgttcgt ggcctctaag atgaaggaga ccatccccct gacggccgag aagctgtgca 60 
tctacaccg 69 

<210> 76 

<211> 20 . 

<212> DNA 

<213> Homo sapiens 

<400> 76 

agatgaagtg gaaggcgctt .20 

<210> 77 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 77 

tgcctctgta atcggcaact g . 21 

<210> 78 

.<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 78 

agatgaagtg gaaggcgctt ttcaccgcgg ccatcctgca ggcacagttg ccgattacag 60 
aggca 65 



<210> 79 
<211> 18 
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<400> 79 ^ 

tggttcccag ccctgtgt .18 

<210> 80 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 80 

ctcctccacc ctgggttgt 19 

<210> 81 

<211> 74. 

<212> DNA 

<213> Homo sapiens 

<400> 81 

tggttcccag ccctgtgtcc acctccaagc ccagattcag attcgagtca tgtacacaac 60 
ccagggtgga ggag 74 

<210> 82 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 82 

tcttgctggc tacgcctctt 20 

<210> 83 

<211> 21 . 

<212> DNA 

<213> Homo sapiens 

<400> 83 

ctgcattgtg gcacagttct g 21 

<210> 84 

<211> 71 . 

<212> DNA 

<213> Homo sapiens 

<400> 84 

tcttgctggc tacgcctctt ctgtccctgt tagacgtcct ccgtccatat cagaactgtg 60 
ccacaatgca g ,71 

<210> 85 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 85 

tgagtgtccc ccggtatctt c 21 

<210> 86 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 86 

cagccgcttt cagattttca t 21 

<210> 87 . 

<211> 81 

<212> DNA 

<213> Homo sapiens 
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<400> 87 

tgagtgtccc ccggtatctt ccccgccctg ccaatcccga tgaaattgga aattttattg 60 
atgaaaatct gaaagcggct g 81 

<210> 88 . 

<211> 21 

<212> DNA 

<213> Homo sapi ens 

<400> 88 

tggagactct cagggtcgaa a .21 

<210> 89 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 89 

ggcgtttgga gtggtagaaa tc 22 

<210> 90 

<211> 65 . 

<212> DNA 

<213> Homo sapiens 

<400> 90 

tggagactct cagggtcgaa aacggcggca gaccagcatg acagatttct accactccaa 60 
acgcc 65 

<210> 91 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 91 

cggtggacca cgaagagtta a 21 

<210> 92 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 92 

ggctcgcctc ttccatgtc 19 

<210> 93 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 93 

cggtggacca cgaagagtta acccgggact tggagaagca ctgcagagac atggaagagg 60 
cgagcc 66 

<210> 94 - 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 94 

gcggaaggtc cctcagaca 19 

<210> 95 
<211> 23 
<212> DNA 

.<213> Homo sapiens 
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<400> 95 

tctaagtttc ccgaggtttc tea 23 

<210> 96 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 96 

gcggaaggtc cctcagacat ccccgattga aagaaccaga gaggctctga gaaacctcgg 60 
gaaacttaga 70 

<210> 97 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 97 

ccagctttgt gcctgtcact at .22 

<210> 98 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 98 

gggaatgtgg tagcccaaga 20 

<210> 99 

<211> 71 

<212> DNA 

<213> Homo sapiens 

<400> 99 

ccagctttgt gcctgtcact attcctcatg ccaccactgc caacacctct gtcttgggct 60 
accacattcc c . 71 

<210> 100 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 100 

ttgctataac taagtgcttc tccaaga 27 

<210> 101 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 101 

gtggaatggc agctcactgt ag 22 

<210> 102 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 102 

ttgctataac taagtgcttc tccaagaccc caactgagtc cccagcacct gctacagtga 60 
gctgccattc cac .73 

<210> 103 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 103 
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aggacgcaag gagggtttg 19 

<210> 104 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 104 

gatgtccgcc gagtccttac t 21 

<210> 105 

<211> 87 

<212> DNA 

<213> Homo sapiens 

<400> 105 

aggacgcaag gagggtttgt cactggcaga ctcgagactg taggcactgc catggcccct 60 
gtgctcagta aggactcggc ggacatc 87 

<210> 106 

<211> 24 

<212> DNA 

<213> Homo sapi ens 

<400> 106 

cuatatgcag ccagagatgt gaca 24 

<210> 107 • 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 107 

ccacgagttt cttactgaga atgg 24 

<210> 108 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 108 

ctatatgcag ccagagatgt gacagccacc gtggacagcc tgccactcat cacagcctcc 60 
attctcagta agaaactcgt gg 82 

<210> 109 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 109 

tgtcgatgga cttccagaac 20 

<210> 110 
<211> 19 

<212> DNA . ^ 

<213> Homo sapiens 

<400> 110 

attgggacag cttggatca 19 

<210> 111 

<211> 62 

<212> DNA 

<c213> Homo sapiens 

<400> 111 

tgtcgatgga cttccagaac cacctgggca gctgccaaaa gtgtgatcca agctgtccca 60 
at 62 
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<210> 112 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 112 

gatctaagat ggcgactgtc gaa 23 

<210> 113 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 113 

ttagattccg ttttctcctc ttctg 25 

<210> 114 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 114 

gatctaagat ggcgactgtc gaaccggaaa ccacccctac tcctaatccc ccgactacag 60 
aagaggagaa aacggaatct aa 82 

<210> 115 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 115 

cggtgtgaga agtgcagcaa 20 

<210> 116 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 116 

cctctcgcaa gtgctccat 19 

<210> 117 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 117 

cggtgtgaga agtgcagcaa gccctgtgcc cgagtgtgct atggtctggg catggagcac 60 

ttgcgagagg .70 

<210> 118 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 118 

cggttatgtc atgccagata cac 23 

<210> 119 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 119 . 

gaactgagac ccactgaaga aagg 24 
<210> 120 
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<211> 81 
<212> DMA 

<213> Homo sapiens, 
<400> 120 

cggttatgtc atgccagata cacacctcaa aggtactccc tcctcccggg aaggcaccct 60 
ttcttcagtg ggtctcagtt c 81 

<210> 121 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 121 

cgtggtgccc ctctatgac 19 

<210> 122 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 122 

ggctagtggg cgcatgtag 19 
<210> 123 

<211> 68 . 

<212> DNA 

<213> Homo sapiens 

<400> 123 

cgtggtgccc ctctatgacc tgctgctgga gatgctggac gcccaccgcc tacatgcgcc 60 
cactagcc 68 

<210> 124 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 124 

tggtccatcg ccagttatca 20 

<210> 125 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 125 

tgttctagcg atcttgcttc aca 23 

<210> 126 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 126 

tggtccatcg ccagttatca catctgtatg cggaacctca aaagagtccc tggtgtgaag 60 
caagatcgct agaaca 76 

<210> 127 

<211> 24 

<212> DNA 

<213> Homo sapi ens 

<400> 127 

catccatgac aactttggta tcgt 24 

<210> 128 
<211> 21 
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<212> DNA 

<213> Homo sapiens 

<400> 128 

cagtcttctg ggtggcagtg a 21 

<210> 129 
<211> 74 
<212> DNA 

<213> Homo sapiens ' 
<400> 129 

catccatgac aactttggta tcgtggaagg actcatgacc acagtccatg ccatcactgc 60 
cacccagaag actg 74 

<210> 130 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 130 

caaaggagct cactgtggtg tct . 23 

<210> 131 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 131 

gagtcagaat ggcttattca cagatg 26 

<210> 132 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 132 

caaaggagct cactgtggtg tctgtgttcc aaccactgaa tctggacccc atctgtgaat 60 
aagccattct gactc 75 

<210> 133 

<211> 20 . 

<212> DNA 

<213> Homo sapiens 

<400> 133 

ccatctgcat ccatcttgtt 20 

<210> 134 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 134 

ggccaccagg gtattatctg 20 

<210> 135 / 

<211> 67 

<212> DNA 

<213> Homo sapiens 

<400> 135 

ccatctgcat ccatcttgtt tgggctcccc acccttgaga agtgcctcag ataataccct 60 
ggtggcc . 67 

<210> 136 
<211> 23 
<212> DNA 
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<213> Homo sapiens 
<400> 136 

cgaaaagatg ctgaacagtg aca 23 

<210> 137 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 137 

tcaggaacag ccaccagtga 20 

<210> 138 
<211> 73 

<212> DNA ' 
<213> Homo sapiens 

<400> 138 

cgaaaagatg ctgaacagtg acaaatccaa ctgaccagaa gggaggagga agctcactgg 60 
tggctgttcc tga 73 

<210> 139 
<211> 20 . 
<212> DNA 

<213> Homo sapiens 
<400> 139 

gagaccctgc tgtcccagaa 20 

<210> 140 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 140 

ggttgtagtc agcgaaggag ate 23 

<210> 141 . 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 141 

gagaccctgc tgtcccagaa ccagggaggc aagaccttca ttgtgggaga ccagatctcc 60 
ttcgctgact acaacc 76 

<210>.142 : . ' . 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 142 

cccactcagt agccaagtca 20 

<210> 143 
<211> 20 

<212> DNA . 
<213> Homo sapiens 

<400> 143 

cacgcaggtg gtatcagtct 20 

<210> 144 . 
<211> 73 . 
<212> DNA 

<213> Homo sapiens 
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<400> 144 

cccactcagt agccaagtca caatgtttgg aaaacagccc gtttacttga gcaagactga 60 
taccacctgc gtg 73 

<210> 145 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 145 

catcaaatgt cagccctgga gttc 24 

<210> 146 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 146 

ttcctgtagg tctttacccc gatagc 26 
<210> 147 

<211> 85 , 

<212> DNA 

<213> Homo sapiens 

<400> 147 

catcaaatgt cagccctgga gttccatgat accacacgaa cacagctttt tgccttcgag 60 
ctatcggggt aaagacctac aggaa 85 

<210> 148 
<211> 24 
<212> DNA 

<213> Homo sapiens . 

<400> 148 . 
tccaggatgt taggaactgt gaag 24 

<210> 149 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 149 

gcgtgtctgc gtagtagctg tt 22 

<210> 150 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 150 

tccaggatgt tagigaactgt gaagatggaa gggcatgaaa ccagcgactg gaacagctac 60 
tacgcagaca cgc 73 

<210> 151 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 151 

aacgactgct actccaagct caa 23 

<210> 152 - 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 152 
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ggatttccat cttgctcacc tt 22 

<210> 153 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 153 

aacgactgct actccaagct caaggagctg gtgcccagca tcccccagaa caagaaggtg 60 
agcaagatgg aaatcc 76 

<210> 154 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 154 ■ 

tccggagctg tgatctaagg a 21 

<210> 155 

<211> 20 - 

<212> DNA 

<213> Homo sapiens 

<400> 155 

cggacagagc gagctgactt 20 

<210> 156 
<211> 76 
<212> DNA 

<213> Homo sapiens i . 

<400> 156 

tccggagctg tgatctaagg aggctggaga tgtattgcgc acccctcaag cctgccaagt 60 
cagctcgctc tgtccg 76 

<210> 157 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 157 

acgcaccggg tgtctga 17 

<210> 158 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 158 

tgccctttct tgatgatgat tatc 24 

<210> .159 

<211> 68 • 

<212> DNA 

<213> Homo sapiens 

<400> 159 

acgcaccggg tgtctgatcc caagttccac cccctccatt caaagataat catcatcaag 60 
aaagggca .68 

<210> 160 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 160 

ccattcaccc tgtgtaacag ga .22 
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<210> 161 . 

<211> Zl 

<212> DNA 

<213> Homo sapiens 

<400> 161 

ccgaccctct aggttaaggc a 21 

<210> 162 

<211> 68 

<212>.DNA 

<213> Homo sapiens 

<400> 162 

ccattcaccc tgtgtaacag gaccccaagg acctgcctcc ccggaagtgc cttaacctag 60 
agggtcgg .68 

<210> 163 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 163 

cgtcaggacc caccatgtct 20 

<210> 164 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 164 

ggttaattgg tgacatcctc aaga 24 

<210> 165 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 165 

cgtcaggacc caccatgtct gccccatcac gcggccgaga catggcttgg ccacagctct 60 
tgaggatgtc accaattaac c 81 

<210> 166 
<211> 23 
<212> DNA 

<213> Homo sapi ens . 

<400> 166 

caaacgctga catgtacggt eta 23 

<210> 167 . \ 

<211> 18 

<212> DNA 

<213> Homo sapi ens 

<400> 167 ' 
gctcgttggc gcactctt. 18 

<210> 168 

<211> 88 

<212> DNA 

<213> Homo sapiens 

<400> 168 

caaacgctga catgtacggt ctatgccatt cctcccccgc atcacatcca ctggtattgg 60 
cagttggagg aagagtgcgc caacgagc 88 
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<210> 169 
<211> 25 

. <212> DNA ' 
<213> Homo sapiens 

<400> 169 

gaggcaactg cttatggctt aatta 25 

<210> 170 

<211> 18 

<212> DNA 

<:213> Homo sapiens 

<400> 170 

ggcactcggc ttgagcat 18 

<210> 171 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 171 

gaggcaactg cttatggctt aattaagtca gatgcggcca tgactgtcgc tgtaaagatg 60 
ctcaagccga gtgcc 75 

<210> 172 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 172 

gtccccggga tggatgtt 18 

<210> 173 

<211> 25 

<212> DNA- 

<213> Homo sapiens 

<400> 173 

gatcagtcaa gctgtctgac aattg 25 

<210> 174. 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 174 

gtccccggga tggatgtttt gccaagtcat tgttggataa gcgagatggt agtacaattg 60 = 
tcagacagct tgactgatc .79 

<210> 175 

<211> 21 . 

<212> DNA 

<213> Homo sapiens 

<400> 175 

cgaggattgg ttcttcagca a 21 ; 

<210> 176 

<211> 22 . 

<212> DNA 

<213> Homo sapi ens 

<400> 176 

actctgcacc agctcactgt tg 22 

<210> 177 
<211> 73 
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<212> 


DNA 


<213> 


Homo 


<400> 


177 


cgaggattgg 


gctggtgcag 


<210> 


178 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


178 


tcagtggaga 


<210> 


179 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


179 


tgccatatcc 


<210> 


180 


<211> 


69 


<212> 


DNA 


<213> 


Homo 


<400> 


180 


tcagtggaga 


gatatggca 


<210> 


181 


<211> 


26 


<212> 


DNA 


<213> 


Homo 


<400> 


181 


gtacaagaga 


<210> 


182 


<211> 


18 


<212> 


DNA 


<213> 


Homo 


<400> 


182 


gtgtagcccg 


<210> 


183 


<211> 


87 


<212> 


DNA 


<213> 


Homo 


<400> 


183 


gtacaagaga. 


gcattaccca 


<210> 


184 


<211> 


22 


<212> 


DNA 


<213> 


Homo 


<400> 


184 


gacatttcca 


<210> 


185 


<211> 


20 


<212> 


DNA 



73 



20 



20 



69 



26 



18 



87 



22 
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<213> Homo sapiens 
<400> 185 

ctccgatcgc acacatttgt 20 

<210> 186 

<211> 86 

<212> DNA 

<213> Homo sapiens 

<400> 186 

gacatttcca gtcctgcagt caatgcctct ctgccccacc ctttgttcag tgtggctggt 60 
gccacgacaa atgtgtgcga tcggag 86 

<210> 187 
<211> 24 . 
<212> DNA 

<213> Homo sapiens 

<400> 187 

gttttggagg aaatgtgttc ttca 24 

<210> 188 
<211> 26 
<212> DNA 

<213> Homo sapiens 

<400> 188 

ttctctaata cactgccgtc ttaagg 26 

<210> 189 
<211> 101 
<212> DNA 
<213>..Homo sapiens 

<400> 189 

gttttggagg aaatgtgttc ttcagtgcac agaatgcagc aaaacagcca tctgataaat 60 
gctctgcaag ccctccctta agacggcagt gtattagaga a 101 

<210> 190 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 190 

acgagaacga gggcatctat gt 22 

<210> 191 - 

<211> 22 . . , 

<212> DNA 

<213> Homo sapiens 

<400> 191 

gcatgtaggt gcttccaatc ac 22 

<210> 192 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 192 

acgagaacga gggcatctat gtgcaggatg tcaagaccgg aciaggtgcgc gctgtgattg 60 
gaagcaccta catgc 75 

<210> 193 

<211> 21 

<212> DNA 

<213> Homo sapiens 
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<400> 193 

tccctccact cggaaggact a 21 

<210> 194 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 194 

cggttgttgc tgatctgtct ca 22 

<210> 195 

<211> 84 

<212> DNA ^ 

<213> Homo sapiens 

<400> 195 

tccctccact cggaaggact atcctgctgc caagagggtc aagttggaca gtgtcagagt 60 
cctgagacag atcagcaaca accg 84 

<210> 196 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 196 . 

ttgttggtgt gccctggtg 19 

<210> 197 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 197 

tgggttctgt ccaaacactg g 21 

<210> 198 

<211> 67 

<212> DNA 

<213> Homo sapiens 

<400> 198 ' 
ttgttggtgt gccctggtgc cgtggtggcg gtcactccct ctgctgccag tgtttggaca 60 
gaaccca 67 

. <210> 199 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 199 

actgaaggag acccttggag 20 

<210> 200 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 200 

taaataaccc tgcccacaca 20 

<210> 201 • 
<211> 62 . 
<212> DNA 
<213> Homo sapiens 

<400> 201 
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actgaaggag acccttggag cctaggggca tcggcaggag agtgtgtggg cagggttatt 60 
ta 62 



<210> 


202 


<211> 


28 


<212> 


DNA 


<213> 


Homo 


<400> 


202 


agttactaaa 


<210> 


203 


<211> 


21 


<212> 


DNA 


<213> 


Homo 


<400> 


203 


gtcggtgagt 


<210> 


204 


<211> 


79 


<212> 


DNA 


<213> 


Homo 


<400> 


204 


agttactaaa 


gcacaaatca 


<210> 


205 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


205 


gggagtttcc 


<210> 


206 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


206 


cttcaaccac 


<210> 


207 


<211> 


72 


<212> 


DNA 


<213> 


Homo 


<400> 


207 


gggagtttcc 


aggtggttga 


<210> 


208 


<211> 


23 


<212> 


DNA 


<213> 


Homo 


<400> 


208 


aggtgtcatc 


<210> 


209 


<211> 


20 


<212> 


DNA 


<213> 


Homo 


<400> 


209 


tcccgatcac 



28 



21 



79 



20 



20 



72 



23 



20 
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<210> 210 
<211> 90 
<212> DNA 

<213> Homo sapiens 
<400> 210 

aggtgtcatc catcaacgtc tctgtgaacg cagtgcagac tgtggtccgc cagggtgaga 60 
acatcaccct catgtgcatt gtgatcggga - 90 

<210> 211 

<211> 24 

<212> DMA 

<213> Homo sapiens 

<400> 211 

agagccagtt gctgtagaac tcaa 24 

<210> 212 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 212 

ctgggcctac acagtccttc a 21 

<210> 213 

<211> 74 

<212> DNA 

<213> Homo sapiens 

<400> 213 

agagccagtt gctgtagaac tcaaatctct gctgggcaag gatgttctgt tcttgaagga 60 
ctgtgtaggc ccag 74 

<210> 214 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 214 , ' 

gaaatgactg catcgttgat aaaatc 26 

<210> 215 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 215 

tgccagcctg acagcactt 19 

<210> 216 
<211> 78 
<212> DNA 

<213> Homo sapiens 
<400> 216 

gaaatgactg catcgttgat aaaatccgca gaaaaaactg cccagcatgt cgccttagaa 60 
agtgctgtca ggctggca 78 

<210> 217 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 217 ' * ' ' 

gatcaacggc tacatccaga .20 
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<210> 218 " 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 218 

tgaactgtga ggccagagac 20 

<210> 219 

<211> 68 

<212> DNA 

<213> Honio sapiens 

<400> 219 

gatcaacggc tacatccaga agatcaagtc gggagaggag gactttgagt ctctggcctc 60 
acagttca 68 

<210> 220 

<2ii> 19 : 

<212> DNA 

<213> Homo sapiens 

<400> 220 

gtggatgtgc cctgaagga 19 

<210> 221 

<211> 20 V 

<212> DNA 

<213> Homo sapiens 

<400> 221 

ctgcggatcc agggtaagaa 20 

<210> 222 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 222 

gtggatgtgc cctgaaggac aagccaggcg tctacacgag agtctcacac ttcttaccct 60 
ggatccgcag 70 

<210> 223 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 223 

tggacttcta gtgatgagaa agattga 27 

<210> 224 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 224 

cactgcgaga tcaccacagg ta 22 

<210> 225 

<211> 84 

<212> DNA 

<213> Homo sapiens 

<400> 225 

tggacttcta gtgatgagaa agattgagaa tgttcccaca ggccccaaca ataagcccaa 60 
gctacctgtg gtgatctcgc agtg 84 

<210> 226 
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<211> 25 • 
<212> DMA 
<213> Homo sapiens 

<400> 226 

tggctaagtg aagatgacaa tcatg 25 

<210> 227 

<211> 25- 

<212> DNA 

<215> Homo sapiens 

<400> 227 

tgcacatatc attacaccag ttcgt 25 

<210> 228 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 228 

tggctaagtg aagatgacaa tcatgttgca gcaattcact . gtaaagctgg aaagggacga 60 
actggtgtaa tgatatgtgc a .81 

<210> 229 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 229 

tctgcagagt tggaagcact eta ^ 23 

<210> 230 

<211> 21 

<212> DNA 

<213>. Homo sapiens 

<400> 230 

gccgaggctt ttctaccaga a 21 

<210> 231 

<211> 79 

<212> DNA 

<213> Homo sapiens 

<400> 231 " 
tctgcagagt tggaagcact ctatggtgac atcgatgctg tggagctgta tcctgccctt 60 
ctggtagaaa agcctcggc .79 

<210> 232 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 232 

acgacacgta tgccgtacag tact 24 

<210> 233 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 233 . ^ 

ccgggaaaac acgaagga 18 



<210> 234 
<211> 86 
<212> DNA 
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<213> Homo sapiens 
<400> 234 

acgacacgta tgccgtacag tactcctgcc gcctcctgaa cctcgatggc acctgtgctg 60 
acagctactc cttcgtgttt tcccgg 86 

<210> 235 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 235 

ctgccgggat ggcttctat 19 

<210> 236 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 236 

ccaggttctg gaaactgtgg at , 22 

<210> 237 
<211> 68 

<212> DNA , 
<213> Homo sapiens 

<400> 237 

ctgccgggat ggcttctatg aggctgagct ctgcccggac cgctgcatcc acagtttcca 60 
gaacctgg 68 

<210> 238 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 238 

ccacaagctg aaggcagaca 20 
<210> 239 

<211> 21 , 

<212> DNA 

<213> Homo sapiens 

<400> 239 

gcgtgcttcc ttggtcttag a .21 

<210> 240 
<211> 85 
<212> DNA 

<213> Homo sapiens . 
<400> 240 

ccacaagctg aaggcagaca aggcccgcaa. gaagctcctg gctgaccagg ctgaggcccg 60 
caggtctaag accaaggaag cacgc 85 

<210> 241 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 241 

ccattctatc atcaacgggt acaa 24 

<210> 242 
<211> 23 

<212> DNA . ■ ■ ' 

<213> Homo sapiens 
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<400> 242 

tcagcaagtg ggaaggtgta ate 23 

<210> 243 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 243 

ccattctatc atcaacgggt acaaacgagt cctggccttg tctgtggaga cggattacac 60 
cttcccactt gctga 75 

<210> 244 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 244 

tatcgaggca ggtcatacca 20 

<210> 245 . . 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 245 

taacgcttgg catcatcatt 20 

<210> 246 

<211> 74 

<212> DNA 

<213> Homo sapiens 

<400> 246 . 

tatcgaggca ggtcatacca tgaccggaag tcaaaagttg acctggatag gctcaatgat 60 
gatgccaagc gtta 74 

<210> 247 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 247 

ccgcaacgtg gttttctca .19 

<210> 248 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 248 

tgctgggttt ctcctcctgt t 21 

<210> 249 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 249 

ccgcaacgtg gttttctcac cctatggggt ggcctcggtg ttggccatgc tccagctgac 60 
aacaggagga gaaacccagc a 81 

<210> 250 

<211> 25 ... 

<212> DNA 

<213> Homo sapiens 
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<400> 250 ^ 
tcaagaccat catcactttc attgt 25 

<210> 251 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 251 

ggatcaggaa gtacacggag tataact 27 

<210> 252 

<211> 96 

<212> DNA 

<213> Homo sapiens 

<400> 25.2 ' 
tcaagaccat catcactttc attgtctcgg acgtgcgggg cctgggcctc ccggtccgca 60 
agcagttcca gttatactcc gtgtacttcc tgatcc . 96 

<210> 253 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 253 

gcccgaaacg ccgaatata 19 

<210> 254 ' 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 254 

cgtggctctc ttatcctcat gat 23 

<210> 255 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 255 

gcccgaaacg ccgaatataa tcccaagcgg tttgctgcgg taatcatgag gataagagag 60 
ccacg 65 

<210> 256 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 256 

gccctcccag tgtgcaaat 19 
<210> 257 

<211> 25 ■ 

<212> DNA . 

<213> Homo sapiens 

<400> 257 

cgtcgatggt attaggatag aagca 25 

<210> 258 • 
<211> 86 

<212> DNA ' • 

<213> Homo sapiens 
<400> 258 

gccctcccag tgtgcaaata agggctgctg tttcgacgac accgttcgtg gggtcccctg 60 
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gtgcttctat cctaatacca tcgacg 86 

<210> 259 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 259 

caagctagat cagcattctc taacttg 27 

<210> 260 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 260 

cacatgactg ttatcgccat ctact 25 

<210> 261 

<211> 99 

<212> DNA 

<213> Homo sapiens 

<400> 261 

caagctagat cagcattctc taacttgttt ggtggagaac cattgtcata tacccggttc 60 
agcctggctc ggcaagtaga tggcgataac agtcatgtg 99 

<210> 262 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 262 

cacaggaaca acagcatctt tc 22 

<210> 263 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 263 

agataagccc ctgggatcca 20 

<210> 264 

<211> 75 

<212> DNA 

<213> Homo sapiens 

<400> 264 

cacaggaaca acagcatctt tcaccaagat gggtggcacc aaccttgctg ggacttggat 60 
cccaggggct tatct 75 

<210> 265 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 265 

ggattgctca acaaccatgc t 21 

<210> 266 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 266 

ggcattaaca cttttggacg ataa 24 
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<210> 267 
<211> 91 
<212> DNA 

<213> Homo sapiens , 
<400> 267 

ggattgctca acaaccatgc tgggcatctg gaccctccta cctctggttc ttacgtctgt 60 
tgctagatta tcgtccaaaa gtgttaatgc c 91 

<210> 268 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 268 

gcactttggg attctttcca ttat 24 

<210> 269 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 269 

gcatgtaaga agaccctcac tgaa 24 

<210> 270 

<211> 80 ... 

<212> DNA 

<213> Homo sapiens 

<400> 270 

gcactttggg attctttcca ttatgattct ttgttacagg caccgagaat gttgtattca 60 
gtgagggtct tcttacatgc 80 

<210> 271 ' 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 271 

aatccaaggg ggagagtgat 20 

<210> 272 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 272 

gtacagattt tgcccgagga 20 

<210> 273 

<211> 72 

<212> DNA 

<213> Homo sapiens 

<400> 273 

aatccaaggg ggagagtgat gacttccata tggactttga ctcagctgtg gctcctcggg 60 
caaaatctgt ac 72 

<210> 274 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 274 

tgtggacatc ttcccctcag a 
<210> 275 



21 



Page 34 



wo 2003/078662 



PCT/US2003/007713 



39740-000lPCT.txt 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 275 

ctagcccgac cggttcgt 18 

<210> 276 

<211> 66 

<212> DNA 

<213> Homo sapiens 

<400> 276 

tgtggacatc ttcccctcag acttccctac tgagccacct tctctgccac gaaccggtcg 60 
ggctag 66 

<210> 277 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 277 

ctttgaaccc ttgcttgcaa . 20 

<210> 278 
<211> 18 
.<212> DNA 
<213> Homo sapiens 

<400> 278 

cccgggacaa agcaaatg 18 

<210> 279 

<211> 68 

<212> DNA 

<213> Homo sapiens 

<400> 279 

ctttgaaccc ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt 60 
gtcccggg 68 

<210> 280 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 280 

gcctcggtgt gcctttca 18 

<210> 281 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 281 

cgtgatgtgc gcaatcatg 19 

<210> 282 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 282 

gcctcggtgt gcctttcaac atcgccagct acgccctgct cacgtacatg attgcgcaca 60 
tcacg 65 

<210> 283 
<211> 20 
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<212> DNA 

<213> Homo sapiens 
<400> 283 

ctgctgtctt gggtgcattg 20 

<210> 284 . 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 284 

gcagcctggg accacttg 18 

<210> 285 

<211> 71 

<212> DNA 

<213> Homo sapiens 

<400> 285 

ctgctgtctt gggtgcattg gagccttgcc ttgctgctct acctccacca tgccaagtgg 60 
tcccaggctg c 71 

<210> 286 
<211> 1947 
<212> DNA 

<213> Homo sapiens 
<400> 286 

ttttccccag atatggggtt ctattcagcc atagataatc tagacagagg atttcagaat 60 
gaaaggaaaa atgtgtggag attagtccta gttcattctg agggccgact aagtggctca 120 
gccagcttct tactccatct gcagttcata ctgccaaaga gctcccactt ccaaatcccc 180 
agtgacttta tggagaagat tctgcattaa attgtctttc gaatgatggg gaagcaaggc 240 
ataatatgcg atgatgagga gaaagtagac cagtgaggtg attgcaagac taacaaggag 300 
actcaatggg aagtttttct ttcttttaga tattgctttt gaagtagatg gtaaaatttt 360 
tgtcatcctt cttgtatttt ttgtacccca agttacaatt tttcttcttc cttgtaaata 420 
atttaaacag tatttatttt tgtaaggcat aactagaaac taaaatatat tctaaaaaat 480 
tcattattct gaacaaagtg atcaaattag aatacatatt tttcaacagt ggtagagctt 540 
ttaatatatg tttattgaaa gttatctata atacttgcac cagtgttgaa aaaagttaac 600 
atgtaggcaa gagcaatatg tttgtctcaa ggatttttcc atggtttcct cagtgatggt 660 
gtcctggaat tattcaggtg gtgaccatca ctggtctaag tttgtgtgca gggttttcag 720 . 
acgtgttttt gtgaaacttg gtagaaccat ggctaataaa gaggacagtg ttgtcagggt 780 
ccatctgccc tccatagaaa aatgtctctg gctcataaaa tgagactccc tcagggacta 840 
aatatgaact gacagcagta actctgatac agaataatct aaattgcatc aaatggcctt 900 
aattcagagt ttgttaggct tatcagtatg ttgcttttaa ttggggtggg aaagtagagg 960 
gagagaaagc aagacattta ttaagcacct cgtatgtgcc aggcactatg ctaagcactt 1020 
tacataagtt aggattaatc cctgcaagaa tcctataaag aatgttacta gcatttacac 1080 
ttcccaaatg aaggtaccaa agctcaaacg caatgttgtg aagctgtttc cttcagattt 1140 
aggttatgtg ggatgatgtg ggattgaaga ggaaagaaag gtgggattat ccccctagga 1200 
agactttcag gcctgacttc ataggaattc atccatctta tcatgtggag tttatctcac 1260 
cctgctgttg caggatgcta tttgcatgtg tccccaggtg atgttttttc tttggggagt 1320 
aggggtttgg cttcctcatt catccctctt gctaaaagag gagatagttg atgttgcatc 1380 
taaagatgct ataagacaat gaaagtttga tgttgtacat acctacaagt accatttttg 1440 
tgcatgatta cactccactg acatcttcca agtactgcat gtgattgaat aagaaacaag 1500 
aaagtgacca caccaaagcc tccctggctg gtgtacaggg atcaggtcca cagtggtaca 1560 
gattcaacca ccacccaggg agtgcttgca gactctgcat agatgttgct gcatgcgtcc 1620 
catgtgcctg tcagaatggc agtgtttaat tctcttgaaa gaaagttatt tgctcactat 1680 
ccccagcctc aaggagccaa ggaagagtca ttcacatgga aggtccgggt ctggtcagcc 1740 
actctgactt ttctaccaca ttaaattctc cattacatct cactattggt aatggcttaa 1800 
gtgtaaagag ccatgatgtg tatattaagc tatgtgccac atatttattt ttagactctc 1860 
cacagcattc atgtcaatat gggattaatg cctaaacttt gtaaatattg tacagtttgt 1920 
aaatcaatga ataaaggttt tgagtgt 1947 

<210> 287 

<211> 1311 

<212> DNA 

<213> Homo sapiens 
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<400> 287 

tagtcgggcg 

cgggcctggc 

cccggctgcg 

gtggtccgcc 

ccaccagggg 

gctcgacccg 

cctggagtga 

gccaggaggt 

tggcggcagc 

ttacctccca 

aggtcatagg 

aaatggaaac 

ggaaaaagaa 

ctgtggagaa 

agggttttct 

aagccacaat 

aaaatttcaa 

tagccgagtg 

caaactttgc 

gcgccaccag 

ggcaactggc 

ctttgtgatt 



39740-000lPCT.txt 



gggttgtgag 
tcagcgcggg 
cgcccttcgg 
tccctctcgg 
cgccgccgcc 
gagcgaggag 
agaggcgacc 
gacccgggac 
tgggctcacc 
gcagggcagc 
ggttccacag 
accgttgtca 
cagtccacag 
gatagctgac 
gcccaaggat 
agagcagttt 
agacagtaga 
tgacacagtg 
cctggccgag 
ctctgccgtc 
catttgccaa 
tgagtaaagc 



acgccgcgct 
ggggcgcgga 
ccaggccggg 
cgtccacctg 
ggcgctcgca 
ttgacccgga 
cagagtgagg 
gaggagtcga 
gtgactgtca 
agtgaaccag 
tcttttcaga 
gcacttggaa 
gaagaggttg 
cagctggaag 
ttgcaagctg 
atgaagatct 
ttgaaaagga 
gagcagaaca 
tgaggtgtag 
tctggatcgg 
ttttcctact 
tcctattctg 



cagcttccat 
gaccgcgagg 
agccgcgcca 
cccggagtac 
ggccgcggat 
gcgaggagtt 
aggcgaccca 
cccggagcga 
cccacagcaa 
ttgtccaaga 
aactcatatt 
tacaagatgg 
aactaaagaa 
agttgaataa 
aagctctctg 
tggaggagat 
aaggcttggt 
tctgccagga 
cagaaaaagg 
aatttacctg 
ctcacactgg 
tttttcacaa 



cgctgggcgg 
cgaccgggag 
gtcggagccc 
tgccagcggg 
gaagaagaaa 
gaccctgagt 
gggcgaagag 
ggaggtgacc 
tgagaagcac 
cctggcccag 
taagggaaaa 
ttgccgggtc 
gttgaaacat 
agagcttact 
caaacttgat 
tgacacactg 
aaaaaaggtt 
gactgagcgg 
ctgtgctgcc 
atttcttcag 
ttctcaatga 
aaaaaaaaaa 



tcaacaagtg 
cggctgggtt 
ccggcccagc 
catgaccgac 
acccggcgcc 
gaggaagcga 
atgaatcgga 
agggaggaaa 
gaccttcatg 
gttgttgaag 
tctctgaagg 
atgttaattg 
ttggagaagt 
ggaatccagc 
aggagagtaa 
atcctgccag 
caggcattcc 
ctgcagtcta 
ctgaagaatg 
ggctgctggg 
aaaatagtgt 
a 



60 

120 

180 . 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1311 



<210> 288 
<211> 582 
<212> DNA 
<213> Homo sapi ens 



<400> 288 

atggcccgcg 

ggcccgcgcc 

gagcccggcc 

gcccccaccg 

ccccgcagcc 

gagcagcacc 

caggcggccc 

ctgcggcgga 

cagcggcacc 

ttacccaggg 



cacgccagga 
ccttcccgct 
tggctgccgc 
ccccacccgc 
ggccccgagg 
tggagtcgcc 
cgggagtccg 
tggcggacga 
gcccctcacc 
gccacagagc 



gggcagctGc 
cggccgcctg 
ccccgccgcc 
cgtcaccgcc 
cccgcgcccg 
cgtgcccagc 
cggggaggag 
cctcaacgca 
ctggagggtc 
ccccgagatg 



ccggagcccg 
gtgccctcgg 
cccaccctgc 
gccctggggg 
gacggtcctc 
gccccggggg 
gaacagtggg 
cagtacgagc 
ctgtacaatc 
gagcccaatt 



tagagggcct 
cagtgtcctg 
tgcccgctgc 
gttcccgctg 
agccctcgct 
ctctggcggg 
cccgggagat 
ggcggagaca 
tcatcatggg 
ag 



ggcccgcgac 60 
cggcctctgc 120 
ctacctctgc 180 
gcctgggggt 240 
ctcgctggcg 300 
cggtcccacc 360 
cggggcccag 420 
agaggagcag 480 
actcctgccc 540 
582 



<210> 289 

<211> 6030 

<212> DNA 

<213> Homo sapiens 



<400> 289 

gttggccccc 

taaccgggag 

ggatgcggga 

ctcgcagccc 

gccgctgcag 

acctgtggtc 

cgacttcgcc 

tgccacggtg 

ctttgagttc 

ggacaacatc 

ggataacgga 

tgatttctcc 

caccctgggt 

caaaaggttc 

agctgcaggc 

cacacacaca 

aagggaaata 

gtcccatcaa 

tggctccacc 

gatcaccatc 

tctggctgct 



gttacttttc 
atagtgatga 
gatgtgggcg 
gggcacacgc 
accccggctg 
cacctgaccc 
gagatgtcca 
gtggaggagc 
ggtggggtca 
gccctgtgga 
ggctgggatg 
tggctgtctc 
gcctatctgg 
actaaagcag 
tgtttaagaa 
caacaattaa 
tcatttattt 
aactcctgtc 
tggatgttct 
tgaagagcag 
ggaggctggg 



ctctgggaaa 
agtacatcca 
ccgcgccccc 
cccatacagc 
cccccggcgc 
tccgccaggc 
ggcagctgca 
tcttcaggga 
tgtgtgtgga 
tgactgagta 
cctttgtgga 
tgaagactct 
gccacaagtg 
tagaaataat 
aaaataacac 
cagtcttcag 
tttacattat 
tttggaaatc 
gtgcctgtaa 
acggatggaa 
gagaaggtgt 



tatggcgcac 
ttataagctg 

gggggccgcc 

cgcatcccgg 
cgccgcgggg 
cggcgacgac 
cctgacgccc 
cggggtgaac 
gagcgtcaac 
cctgaaccgg 
actgtacggc 
gctcagtttg 
aagtcaacat 
atgcattgtc 
acatataaac 
gcaaaacgtc 
taagaaaaaa 
cgaccactaa 
acatagattc 
aaaggacctg 
tcattcactt 
Page 3; 



gctgggagaa 
tcgcagaggg 
cccgcgccgg 
gacccggtcg 
cctgcgctca 
ttctcccgcc 
ttcaccgcgc 
tgggggagga 
cgggagatgt 
cacctgcaca 
cccagcatgc 
gccctggtgg 
gcctgcccca 
agtgatgttc 
atcacacaca 
gaatcagcta 
agatttattt 
ttgccaagca 
gctttccatg 
atcattgggg 
gcatttcttt 



cagggtacga 
gctacgagtg 
gcatcttctc 
ccaggacctc 
gcccggtgcc 
gctaccgccg 
ggggacgctt 
ttgtggcctt 
cgcccctggt 
cctggatcca 
ggcctctgtt 
gagcttgcat 
aacaaatatg 
catgaaacaa 
cagacagaca 
tttactgcca 
atttaagaca 
ccgcttcgtg 
ttgttggccg 
aagctggctt 
gccctggggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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ctgtgatatt aacagaggga gggttcctgt 
gaagagactc tttgcatatg actcacatga 
aacttcagat ggacctagta cccactgaga 
atgcccttaa atcataggaa agtatttttt 
ttagcaattt atacaatatc atccagtacc 
ttggatacgc accccccaac tcccaatact 
tggaacttga ggaagtgaac atttcggtga 
gagcatcagg ccgccacaag tgcctgcttt 
tgtcccagct tggaggcctg gtcctggaac 
ggatgatcaa cagggcagtg tggtctccga 
cactgtcaag aaagagcagt agaggggtgt 
gtaggcccgt tttcacgtgg agcatgggag 
tagagggaag gaacagaggc cctgggccct 
. aacgtgagga gaggcaatgg ccacggccca 
tgtggccttg gcccacctgt gagtttaaag 
caaatcctaa aagaagcatt gaagtgaggt 
attacatgta aaacattatc ttgtcactgt 
aaaaagttcc aggtgtggaa tatgggggtt 
atcaatggtg gggaactata aagaagtaac 
aggaaatttt tttttcttcc agtttagaat 
tggcattatt gcattatata ccatttatct 
gtttaatgct gtggttgata tttcgaaagc 
ttttttgttt ttaattgtat ttagttatgg 
gttttctgtt tgagattttt atctcttgat 
agccctgagt ctcagctacc taagaaaaac 
tttcaaccaa gtcatgtgca tttccacgtc 
tgttgtccct ttgaccttgt ttcttgaagg 
attcatggta ttcaggatta catgcatgtt 
aaatccagat ggcaaatgac cagcagattc 
tgctttacgt ggcctgtttc aacacagacc 
ggggctttct catggctgtc cttcagggtc 
ccaagaaagc aggaaacctg tggtatgaag 
gaatgatcag acctttgaat gattctaatt 
tacattgtca aagtgatgaa tatggaatat 
attttaatgg agtcagtttg cagtatgctc 
aagtaacaat gaagaacgtg gacgctttta 
ttcaaacggg attcacagag tatttgaaaa 
aattgctggc tggctgcctt ttgctgtggg 
tgtgcccagc ctcttggccc cagaactgta 
agttgatgtt gcattttcct tattgttaaa 
agcctcaact agtcattttt ttctcctctt 
agttgggcaa cagagaacca tccctatttt 
aactgctctt tatgaatgaa aaaacagtcc 
ggtcagagtt aaatagagta tatgcacttt 
ccccaaaagg agaagaacat ctgagaacct 
atactccgca agagaggcca gaatgacagc 
cgaagatttg gcaggggcag aaaactctgg 
atcaaggaag cacctcaatt tagttcaaac 
tacctctctg tgttcagatg tggccttcca 
tgcttatcat ctaaagatgt agctctggcc 
tcgagaggag ttataataat caagattaaa 
tctagctttc acctccagga tctattgagt 
attgaactta tcctaaaaca aatagtttat 
actgtacttt taaggcagtg gctgttttta 
acacctactc tatcagagaa aaacaggaaa 
tagggagtca gttgaaattc tattctgatc 
aaatgtggtt acacactttt taagaaatac 
caatcagatc tttattgtta ttcaatttgg 
atgggacaaa ggacatttgt tggaggggtg 
attcccaagt ttggatcagg gagttggaag 
aggacctgta ttggggtcga tgtgatgcct 
acattttgaa gtttgtggta cgacctttag 
cagctccgtt tggcagtgca atggtataaa 
aaacaataaa tgtgcagttt taactaacag 
ggacatctgt ttctaaatgt ttattatgta 
caatgtgaaa ctgaattgga gagtgataat 
ttctgttcca tgtctttgga caaccatgac 
ggatgcaaag aaaatcagat ggagcatgaa 



39740~000lPCT.txt 

ggggggaagt ccatgcctcc ctggcctgaa 1320 
tgcatacctg gtgggaggaa aagagttggg 1380 
tttccacgcc gaaggacagc gatgggaaaa 1440 
taagctacca attgtgccga gaaaagcatt 1500 
ttaagccctg attgtgtata ttcatatatt 1560 
ggctctgtct gagtaagaaa cagaatcctc 1620 
cttccgcatc aggaaggcta gagttaccca 1680 
taggagaccg aagtccgcag aacctgcctg 1740 
tgagccgggg ccctcactgg cctcctccag 1800 
atgtctggaa gctgatggag ctcagaattc 1860 
ggctgggcct gtcaccctgg ggccctccag 1920 
ccacgaccct tcttaagaca tgtatcactg 1980 
tcctatcaga aggacatggt gaaggctggg 2040 
ttttggctgt agcacatggc acgttggctg 2100 
caaggcttta aatgactttg gagagggtca 2160 
gtcatggatt aattgacccc tgtctatgga 2220 
agtttggttt tatttgaaaa cctgacaaaa 2280 
atctgtacat cctggggcat taaaaaaaaa 2340 
aaaagaagtg acatcttcag caaataaact 2400 
cagccttgaa acattgatgg aataactctg 2460 
gtattaactt tggaatgtac tctgttcaat 2520 
tgctttaaaa aaatacatgc atctcagcgt 2580 
cctatacact atttgtgagc aaaggtgatc 2640 
tcttcaaaag cattctgaga aggtgagata 2700 
ctggatgtca ctggccactg aggagctttg 2760 
aacagaattg tttattgtga cagttatatc 2820 
tttcctcgtc cctgggcaat tccgcattta 2880 
tggttaaacc catgagattc attcagttaa 2940 
aaatctatgg tggtttgacc tttagagagt 3000 
cacccagagc cctcctgccc tccttccgcg 3060 
ttcctgaaat gcagtggtgc ttacgctcca 3120 
ccagacctcc ccggcgggcc tcagggaaca 3180 
tttaagcaaa atattatttt atgaaaggtt 3240 
ccaatcctgt gctgctatcc tgccaaaatc 3300 
cacgtggtaa gatcctccaa gctgctttag 3360 
atataaagcc tgttttgtct tctgttgttg 3420 
atgtatatat attaagaggt cacgggggct 3480 
gttttgttac ctggttttaa taacagtaaa 3540 
cagtattgtg gctgcacttg ctctaagagt 3600 
aacatgttag aagcaatgaa tgtatataaa 3660 
cttttttttc attatatcta attattttgc 3720 
gtattgaaga gggattcaca tctgcatctt 3780 
tctgtatgta ctcctcttta cactggccag 3840 
ccaaattggg gacaagggct ctaaaaaaag 3900 
cctcggccct cccagtccct cgctgcacaa 3960 
tgacagggtc tatggccatc gggtcgtctc 4020 
caggcttaag atttggaata aagtcacaga 4080 
aagacgccaa cattctctcc acagctcact 4140 
tttatatgtg atctttgttt tattagtaaa 4200 
cagtgggaaa aattaggaag tgattataaa 4260 
tgtaaataat cagggcaatc ccaacacatg 4320 
gaacagaatt gcaaatagtc tctatttgta 4380 
aaatgtgaac ttaaactcta attaattcca 4440 
gactttctta tcacttatag ttagtaatgt 4500 
ggctcgaaat acaagccatt ctaaggaaat 4560 
ttattctgtg gtgtcttttg cagcccagac 4620 
aattctacat tgtcaagctt atgaaggttc 4680 
atctttcagg gatttttttt ttaaattatt 4740 
ggagggagga acaattttta aatataaaac 4800. 
ttttcagaat aaccagaact aagggtatga 4860 
ctgcgaagaa ccttgtgtga caaatgagaa 4920 
attccagaga catcagcatg gctcaaagtg 4980 
tttcaagctg gatatgtcta atgggtattt 5040 
gatatttaat gacaaccttc tggttggtag 5100 
caatacagaa aaaaatttta taaaattaag 5160 
acaagtcctt tagtcttacc cagtgaatca 5220 
cttggacaat catgaaatat gcatctcact 5280 
tggtactgta ccggttcatc tggactgccc 5340 
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cagaaaaata acttcaagca aacatcctat caacaacaag gttgttctgc ataccaagct 5400 
gagcacagaa gatgggaaca ctggtggagg atggaaaggc tcgctcaatc aagaaaattc 5460 
tgagactatt aataaataag actgtagtgt agatactgag taaatccatg cacctaaacc 5520 
ttttggaaaa tctgccgtgg gccctccaga tagctcattt cattaagttt ttccctccaa 5580 
ggtagaattt gcaagagtga cagtggattg catttctttt ggggaagctt tcttttggtg 5640 
gttttgttta ttataccttc ttaagttttc aaccaaggtt tgcttttgtt ttgagttact 5700 
ggggttattt ttgttttaaa taaaaataag tgtacaataa gtgtttttgt attgaaagct 5760 
tttgttatca agattttcat acttttacct tccatggctc tttttaagat tgatactttt 5820 
aagaggtggc tgatattctg caacactgta cacataaaaa atacggtaag gatactttac 5880 
atggttaagg taaagtaagt ctccagttgg ccaccattag ctataatggc actttgtttg 5940 
tgttgttgga aaaagtcaca ttgccattaa actttccttg tctgtctagt taatattgtg 6000 
aagaaaaata aagtacagtg tgagatactg 6030 



<210> 290 
<211> 10987 
<212> DNA 

<213> Homo sapiens 



<400> 290 
ggtggcgcga 
tctgctgcgc 
acagatttgt 
ctggagcgga 
tccaaagaga 
ggaccaataa 
gaacctgcag 
. ccacaaagga 
gggctgactc 
ttaggaagga 
gatcaagcag 
gttctacaat 
tttcatacac 
ggagctgagg 
agttctactg 
actactgcta 
agatttatcg 
ggatttggaa 
aagtcaatgc 
gaagatagtt 
actagcaaga 
aaaaaccaag 
ccattagatt 
aaggaagttg 
ggagcccaga 
gaaaaagacc 
tctttgccac 
gtaaataaga 
aagcaggcaa 
atattcagaa 
actgatccaa 
gtttgctcac 
gccaccacca 
aagaaaacaa 
ataccgaaag 
tttgaagcac 
agaagctgtt 
acaattctga 
gatcttgatt 
gaagctgatt 
aaagtttcag 
aaagtggaat 
aatgccagca 
atttctagag 
tctgatgttg 
aatgaaaatt 
ccttcaagaa 



gcttctgaaa 
ctcgggtgtc 
gaccggcgcg 
cttatttacc 
ggccaacatt 
gtcttaattg 
aagaatctga 
aaccatctta 
tgccgctgta 
atgttcccaa 
atgatgtttc 
gtacacatgt 
caaagtttgt 
tggatcctga 
tgctcatagt 
atgtgaaaag 
cttctgtgac 
aaacatcagg 
caaatgtcct 
tttcattatg 
ctaggaaaaa 
tgaaagaaaa 
caaatgtagc 
taccgtcttt 
tggagaaaat 
tattagacac 
gtatttctag 
gagatgaaga 
tatctggaac 
taagagaatc 
actttaaaaa 
agaaggagga 
cacagaattc 
ataagtttat 
accaaaaatc 
cacttacatt 
cacagaatga 
ggaaatgttc 
ataaagaagc 
ctctgtcatg 
atataaaaga 
acagtgatac 
ctcttatttt 
gcaaagaatc 
aattaaccaa 
ataaaaacgt 
aggtacaatt 



ctaggcggca gaggcggagc cgctgtggca ctgctgcgcc 60 
ttttgcggcg gtgggtcgcc gccgggagaa gcgtgagggg 120 
gtttttgtca gcttactccg gccaaaaaag aactgcacct 180 
aagcattgga ggaatatcgt aggtaaaaat gcctattgga 240 
ttttgaaatt tttaagacac gctgcaacaa agcagattta 300 
gtttgaagaa ctttcttcag aagctccacc ctataattct 360 
acataaaaac aacaattacg aaccaaacct atttaaaact 420 
taatcagctg gcttcaactc caataatatt caaagagcaa 480 
ccaatctcct gtaaaagaat tagataaatt caaattagac 540 
tagtagacat aaaagtcttc gcacagtgaa aactaaaatg 600 
ctgtccactt ctaaattctt gtcttagtga aagtcctgtt 660 
aacaccacaa agagataagt cagtggtatg tgggagtttg 720 
gaagggtcgt cagacaccaa aacatatttc tgaaagtcta 780 
tatgtcttgg tcaagttctt tagctacacc acccaccctt 840 
cagaaatgaa gaagcatctg aaactgtatt tcctcatgat 900 
ctatttttcc aatcatgatg aaagtctgaa gaaaaatgat 960 
agacagtgaa aacacaaatc aaagagaagc tgcaagtcat 1020 
gaattcattt aaagtaaata gctgcaaaga ccacattgga 1080 
agaagatgaa gtatatgaaa cagttgtaga tacctctgaa 1140 
tttttctaaa tgtagaacaa aaaatctaca aaaagtaaga 1200 
aattttccat gaagcaaacg ctgatgaatg tgaaaaatct 1260 
atactcattt gtatctgaag tggaaccaaa tgatactgat 1320 
acatcagaag ccctttgaga gtggaagtga caaaatctcc 1380 
ggcctgtgaa tggtctcaac taaccctttc aggtctaaat 1440 
acccctattg catatttctt catgtgacca aaatatttca 1500 
agagaacaaa agaaagaaag attttcttac ttcagagaat 1560 
cctaccaaaa tcagagaagc cattaaatga ggaaacagtg 1620 
gcagcatctt gaatctcata cagactgcat tcttgcagta 1680 
ttctccagtg gcttcttcat ttcagggtat caaaaagtct 1740 
acctaaagag actttcaatg caagtttttc aggtcatatg 1800 
agaaactgaa gcctctgaaa gtggactgga aatacatact 1860 
ctccttatgt ccaaatttaa ttgataatgg aagctggcca 1920 
tgtagctttg aagaatgcag gtttaatatc cactttgaaa 1980 
ttatgctata catgatgaaa cattttataa aggaaaaaaa 2040 
agaactaatt aactgttcag cccagtttga agcaaatgct 2100 
tgcaaatgct gattcaggtt tattgcattc ttctgtgaaa 2160 
ttctgaagaa ccaactttgt ccttaactag ctcttttggg 2220 
tagaaatgaa acatgttcta ataatacagt aatctctcag 2280 
aaaatgtaat aaggaaaaac tacagttatt tattacccca 2340 
cctgcaggaa ggacagtgtg aaaatgatcc aaaaagcaaa 2400 
agaggtcttg gctgcagcat gtcacccagt acaacattca 2460 
tgactttcaa tcccagaaaa gtcttttata tgatcatgaa 2520 
aactcctact tccaaggatg ttctgtcaaa cctagtcatg 2580 
atacaaaatg tcagacaagc tcaaaggtaa caattatgaa 2640 
aaatattccc atggaaaaga atcaagatgt atgtgcttta 2700 
tgagctgttg ccacctgaaa aatacatgag agtagcatca 2760 
caaccaaaac acaaatctaa gagtaatcca aaaaaatcaa 2820 
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gaagaaacta cttcaatttc aaaaataact gtcaatccag actctgaaga acttttctca 2880 
gacaatgaga ataattttgt cttccaagta gctaatgaaa ggaataatct tgctttagga 2940 
aatactaagg aacttcatga aacagacttg acttgtgtaa acgaacccat tttcaagaac 3000 
tctaccatgg ttttatatgg agacacaggt gataaacaag caacccaagt gtcaattaaa 3060 
aaagatttgg tttatgttct tgcagaggag aacaaaaata gtgtaaagca gcatataaaa 3120 
atgactctag gtcaagattt aaaatcggac atctccttga atatagataa aataccagaa 3180 
aaaaataatg attacatgaa caaatgggca ggactcttag gtccaatttc aaatcacagt 3240 
tttggaggta gcttcagaac agcttcaaat aaggaaatca agctctctga acataacatt 3300 
aagaagagca aaatgttctt caaagatatt gaagaacaat atcctactag tttagcttgt 3360 
gttgaaattg taaatacctt ggcattagat aatcaaaaga aactgagcaa gcctcagtca 3420 
attaatactg tatctgcaca tttacagagt agtgtagttg tttctgattg taaaaatagt 3480 
catataaccc ctcagatgtt attttccaag caggatttta attcaaacca taatttaaca 3540 
cctagccaaa aggcagaaat tacagaactt tctactatat tagaagaatc aggaagtcag 3600 
tttgaattta ctcagtttag aaaaccaagc tacatattgc agaagagtac atttgaagtg 3660 
cctgaaaacc agatgactat cttaaagacc acttctgagg aatgcagaga tgctgatctt 3720 
catgtcataa tgaatgcccc atcgattggt caggtagaca gcagcaagca atttgaaggt 3780 
acagttgaaa ttaaacggaa gtttgctggc ctgttgaaaa atgactgtaa caaaagtgct 3840 
tctggttatt taacagatga aaatgaagtg gggtttaggg gcttttattc tgctcatggc 3900 
acaaaactga atgtttctac tgaagctctg caaaaagctg tgaaactgtt tagtgatatt 3960 
gagaatatta gtgaggaaac ttctgcagag gtacatccaa taagtttatc ttcaagtaaa 4020 
tgtcatgatt ctgttgtttc aatgtttaag atagaaaatc ataatgataa aactgtaagt 4080 
gaaaaaaata ataaatgcca actgatatta caaaataata ttgaaatgac tactggcact 4140 
tttgttgaag aaattactga aaattacaag agaaatactg aaaatgaaga taacaaatat 4200 
actgctgcca gtagaaattc tcataactta gaatttgatg gcagtgattc aagtaaaaat 4260 
gatactgttt gtattcataa agatgaaacg gacttgctat ttactgatca gcacaacata 4320 
tgtcttaaat tatctggcca gtttatgaag gagggaaaca ctcagattaa agaagatttg 4380 
tcagatttaa cttttttgga agttgcgaaa gctcaagaag catgtcatgg taatacttca 4440 
aataaagaac agttaactgc tactaaaacg gagcaaaata taaaagattt tgagacttct 4500 
gatacatttt ttcagactgc aagtgggaaa aatattagtg tcgccaaaga gtcatttaat 4560 
aaaattgtaa atttctttga tcagaaacca gaagaattgc ataacttttc cttaaattct 4620 
gaattacatt ctgacataag aaagaacaaa atggacattc taagttatga ggaaacagac 4680 
atagttaaac acaaaatact gaaagaaagt gtcccagttg gtactggaaa tcaactagtg 4740 
accttccagg gacaacccga acgtgatgaa aagatcaaag aacctactct gttgggtttt 4800 
catacagcta gcgggaaaaa agttaaaatt gcaaaggaat ctttggacaa agtgaaaaac 4860 
ctttttgatg aaaaagagca aggtactagt gaaatcacca gttttagcca tcaatgggca 4920 
aagaccctaa agtacagaga ggcctgtaaa gaccttgaat tagcatgtga gaccattgag 4980 
atcacagctg ccccaaagtg taaagaaatg cagaattctc tcaataatga taaaaacctt 5040 
gtttctattg agactgtggt gccacctaag ctcttaagtg ataatttatg tagacaaact 5100 
gaaaatctca aaacatcaaa aagtatcttt ttgaaagtta aagtacatga aaatgtagaa 5160 
aaagaaacag caaaaagtcc tgcaacttgt tacacaaatc agtcccctta ttcagtcatt 5220 
gaaaattcag ccttagcttt ttacacaagt tgtagtagaa aaacttctgt gagtcagact 5280 
tcattacttg aagcaaaaaa atggcttaga gaaggaatat ttgatggtca accagaaaga 5340 
ataaatactg cagattatgt aggaaattat ttgtatgaaa ataattcaaa cagtactata 5400 
gctgaaaatg acaaaaatca tctctccgaa aaacaagata cttatttaag taacagtagc 5460 
atgtctaaca gctattccta ccattctgat gaggtatata atgattcagg atatctctca 5520 
aaaaataaac ttgattctgg tattgagcca gtattgaaga atgttgaaga tcaaaaaaac 5580 
actagttttt ccaaagtaat atccaatgta aaagatgcaa atgcataccc acaaactgta 5640 
aatgaagata tttgcgttga ggaacttgtg actagctctt caccctgcaa aaataaaaat 5700 
gcagccatta aattgtccat atctaatagt aataattttg aggtagggcc acctgcattt 5760 
aggatagcca gtggtaaaat cgtttgtgtt tcacatgaaa caattaaaaa agtgaaagac 5820 
atatttacag acagtttcag taaagtaatt aaggaaaaca acgagaataa atcaaaaatt 5880 
tgccaaacga aaattatggc aggttgttac gaggcattgg atgattcaga ggatattctt 5940 
cataactctc tagataatga tgaatgtagc acgcattcac ataaggtttt tgctgacatt 6000 
cagagtgaag aaattttaca acataaccaa aatatgtctg gattggagaa agtttctaaa 6060 
atatcacctt gtgatgttag tttggaaact tcagatatat gtaaatgtag tatagggaag 6120 
cttcataagt cagtctcatc tgcaaatact tgtgggattt ttagcacagc aagtggaaaa 6180 
tctgtccagg tatcagatgc ttcattacaa aacgcaagac aagtgttttc tgaaatagaa 6240 
gatagtacca agcaagtctt ttccaaagta ttgtttaaaa gtaacgaaca ttcagaccag 6300 
ctcacaagag aagaaaatac tgctatacgt actccagaac atttaatatc ccaaaaaggc 6360 
ttttcatata atgtggtaaa ttcatctgct ttctctggat ttagtacagc aagtggaaag 6420 
caagtttcca ttttagaaag ttccttacac aaagttaagg gagtgttaga ggaatttgat 6480 
ttaatcagaa ctgagcatag tcttcactat tcacctacgt ctagacaaaa tgtatcaaaa 6540 
atacttcctc gtgttgataa gagaaaccca gagcactgtg taaactcaga aatggaaaaa 6600 
acctgcagta aagaatttaa attatcaaat aacttaaatg ttgaaggtgg ttcttcagaa 6660 
aataatcact ctattaaagt ttctccatat ctctetcaat ttcaacaaga caaacaacag 6720 
ttggtattag gaaccaaagt ctcacttgtt gagaacattc atgttttggg aaaagaacag 6780 
gcttcaccta aaaacgtaaa aatggaaatt ggtaaaactg aaactttttc tgatgttcct 6840 
gtgaaaacaa atatagaagt ttgttctact tactccaaag attcagaaaa ctactttgaa 6900 
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acagaagcag tagaasittgc taaagctttt 
ctgccaagtc atgccacaca ttctcttttt 
tcaaattcaa gaattggaaa aagaagagga 
atcaaaagaa acttattaaa tgaatttgac 
aaggcttcaa aaagcactcc agatggcaca 
gtttctttag agccgattac ctgtgtaccc 
cagaatccaa attttaccgc acctggtcaa 
catctgactt tggaaaaatc ttcaagcaat 
gtttctgcta caagaaatga aaaaatgaga 
gtctttgttc caccttttaa aactaaatca 
aatattaact tggaggaaaa cagacaaaag 
agtaaaaata agattaatga caatgagatt 
gcagcagctg taactttcac aaagtgtgaa 
cagaatgcca gagatataca ggatatgcga 
ccacagccag gcagtctgta tcttgcaaaa 
gcagcagtag gaggccaagt tccctctgcg 
gtttctaaac attgcataaa aattaacagc 
gaagattatt ttggtaagga aagtttatgg 
ggatggctca taccctccaa tgatggaaag 
tgtgacactc caggtgtgga tccaaagctt 
agatggatca tatggaaact ggcagctatg 
agatgcctaa gcccagaaag ggtgcttctt 
gatagaagca gaagatcggc tataaaaaag 
acacttgttc tctgtgtttc tgacataatt 
agcaataaaa ctagtagtgc agatacccaa 
tggtatgctg ttaaggccca gttagatcct 
ctgacagttg gtcagaagat tattcttcat 
tgtacacctc ttgaagcccc agaatctctt 
cctgctcgct ggtataccaa acttggattc 
ttatcatcgc ttttcagtga tggaggaaat 
gcatacccta tacagtggat ggagaagaca 
agagaggaag aaaaggaagc agcaaaatat 
ttattcacta aaattcagga ggaatttgaa 
ttaccatcac gtgcactaac aagacagcaa 
tatgaagcag tgaagaatgc agcagaccca 
cagttaagag ccttgaataa tcacaggcaa 
cagttggaaa ttaggaaggc catggaatct 
gatgtcacaa ccgtgtggaa gttgcgtatt 
gttatactga gtatttggcg tccatcatca 
agatacagaa tttatcatct tgcaacttca 
atacagttag cagcgacaaa aaaaactcag 
ttatttcaga tttaccagcc acgggagccc 
tttcagccat cttgttctga ggtggaccta 
acaggacttg cccctttcgt ctatttgtca 
ttttggatag accttaatga ggacattatt 
ctccagtggc gaccagaatc caaatcaggc 
gtgttttctg ctagtccaaa agagggccac 
actgttgaga atattgacat actttgcaat 
catgcaaatg atcccaagtg gtccacccca 
gctcaaatca ttcctggtac aggaaacaag 
tattatcaaa gtcctttatc actttgtatg 
tcagcccaga tgacttcaaa gtcttgtaaa 
tgcaaaaaga gaagagcctt ggatttcttg 
cccatttgta catttgtttc tccggctgca 
ggcaccaaat acgaaacacc cataaagaaa 
tttaaaaaat tcaatgaaat ttctcttttg 
gcattgataa atacccaagc tcttttgtct 
gtcagtgaat ccactaggac tgctcccacc 
cgttgtacta catctctgat caaagaacag 
gagaaaaata agcaggacac aattacaact 
acaataaatt attgacgctt aacctttcca 
acacattagt acttatgttg cacaatgaga 
tttgtgtatc gggcaaaaat cgttttgccc 
tgcatatctt aaaactaaat gtaatttatt 
ctcggtggct catgcctgta atcccaacac 
aggccaggag ttcaagacca gcctgggcaa 
aaaaaaaagg ggaaaagaaa atcttttaaa 
-ttacaatcaa caaaatggtc atccaaactc 
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atggaagatg atgaactgac agattctaaa 6960 

acatgtcccg aaaatgagga aatggttttg 7020 

gagcccctta tcttagtggg agaaccctca 7080 

aggataatag aaaatcaaga aaaatcctta 7140 

ataaaagatc gaagattgtt tatgcatcat 7200 

tttcgcacaa ctaaggaacg tcaagagata 7260 

gaatttctgt ctaaatctca tttgtatgaa 7320 

ttagcagttt caggacatcc attttateaa 7380 

cacttgatta ctacaggcag accaaccaaa 7440 

cattttcaca gagttgaaca gtgtgttagg 7500 

caaaacattg atggacatgg ctctgatgat 7560 

catcagttta acaaaaacaa ctccaatcaa 7620 

gaagaacctt tagatttaat tacaagtctt 7680 

attaagaaga aacaaaggca acgcgtcttt 7740 

acatccactc tgcctcgaat ctctctgaaa 7800 

tgttctcata aacagctgta tacgtatggc 7860 

aaaaatgcag agtcttttca gtttcacact 7920 

actggaaaag gaatacagtt ggctgatggt 7980 

gctggaaaag aagaatttta tagggctctg 8040 

atttctagaa tttgggttta taatcactat 8100 

gaatgtgcct ttcctaagga atttgctaat 8160 

caactaaaat acagatatga tacggaaatt 8220 

ataatggaaa gggatgacac agctgcaaaa 8280 

tcattgagcg caaatatatc tgaaaqttct 8340 

aaagtggcca ttattgaact tacagatggg 8400 

cccctcttag ctgtcttaaa gaatggcaga 8460 

ggagcagaac tggtgggctc tcctgatgcc 8520 

atgttaaaga tttctgctaa cagtactcgg 8580 

tttcctgacc ctagaccttt tcctctgccc 8640 

gttggttgtg ttgatgtaat tattcaaaga 8700 

tcatctggat tatacatatt tcgcaatgaa 8760 

gtggaggccc aacaaaagag actagaagcc 8820 

gaacatgaag aaaacacaac aaaaccatat 8880 . 

gttcgtgctt tgcaagatgg tgcagagctt 8940 

gcttaccttg agggttattt cagtgaagag 9000 

atgttgaatg ataagaaaca agctcagatc 9060 

gctgaacaaa aggaacaagg tttatcaagg 9120 

gtaagctatt caaaaaaaga aaaagattca 9180 

gatttatatt ctctgttaac agaaggaaag 9240 

aaatctaaaa gtaaatctga aagagctaac 9300 

tatcaacaac taccggtttc agatgaaatt 9360 

cttcacttca gcaaattttt agatccagac 9420 

ataggatttg tcgtttctgt tgtgaaaaaa 9480 

gacgaatgtt acaatttact ggcaataaag 9540 

aagcctcata tgttaattgc tgcaagcaac 9600 

cttcttactt tatttgctgg agatttttct 9660 

tttcaagaga cattcaacaa aatgaaaaat 9720 

gaagcagaaa acaagcttat gcatatactg 9780 

actaaagact gtacttcagg gccgtacact 9840 

cttctgatgt cttctcctaa ttgtgagata 9900 

gccaaaagga agtctgtttc cacacctgtc 9960 

ggggagaaag agattgatga ccaaaagaac 10020 

agtagactgc ctttacctcc acctgttagt 10080 

cagaaggcat ttcagccacc aaggagttgt 10140 

aaagaactga antctcctca gatgactcca 10200 

gaaagtaatt caatagctga cgaagaactt 10260 

ggttcaacag gagaaaaaca atttatatct 10320 

agttcagaag attatctcag actgaaacga 10380 

gagagttccc aggccagtac ggaagaatgt 10440 

aaaaaatata tctaagcatt tgcaaaggcg 10500 

gtttataaga ctggaatata atttcaaacc 10560 

aaagaaatta gtttcaaatt tacctcagcg 10620 

gattccgtat tggtatactt ttgcttcagt 10680 

aactaatcaa gaaaaacatc tttggctgag 10740 

tttgagaagc tgaggtggga ggagtgcttg 10800 

catagggaga cccccatctt tacgaagaaa 10860 

tctttggatt tgatcactac aagtattatt 10920 

aaacttgaga aaatatcttg ctttcaaatt 10980 
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10987 



<210> 291 

<211> 1552 

<212> DNA 

<213> Homo sapiens 



<400> 291 

gcccgtacac 

agcccctggc 

ctgtcactgc 

tcccccttgg 

agtgaagagg 

gaggatctac 

tccctgaagt 

aataatgccc 

ccgccctggc 

cgcccccagc 

ctcccgccgc 

cctcctgggc 

ctgcactggg 

cctgccgaga 

gggcgcccgg 

agtgcctatg 

caggtcccag 

tatgaggggt 

cagacagtga 

ggtgactctc 

gaggcctcct 

aattcctgcc 

accagcgtcg 

gtgagctacc 

tgtgagaagc 

atgccacttc 



accgtgtgct 
tccctctgtt 
tgcttctgat 
gaggaggctc 
attcacccag 
ctggagagga 
tagaggatct 
acagggacaa 
cccgggtgtc 
tcgccgcctt 
tcccagaact 
tagagatggc 
gggctgcagg 
tccacgtggt 
gaggcctggc 
agcagttgct 
gactggacat 
ctctgactac 
tgctgagtgc 
ggctacagct 
tccctgctgg 
tggctgctgg 
cgttccttgt 
gcccagcaga 
cagccagagg 
cttttaactg 



gggacaccec 
gatcccggcc 
gcctgtccat 
ttctggggaa 
agaggaggat 
ggatctacct 
acctactgtt 
agaaggggat 
cccagcctgc 
ctgcccggcc 
gcgcctgcgc 
tctgggtccc 
tcgtccgggc 
tcacctcagc 
cgtgttggcc 
gtctcgcttg 
atctgcactc 
accgccctgt 
taagcagctc 
gaacttccga 
agtggacagc 
tgacatccta 
gcagatgaga 
ggtagccgag 
catctgaggg 
ccaagaaatt 



acagtcagcc 
cctgctccag 
ccccagaggt 
gatgacccac 
ccacccggag 
gaagttaagc 
gaggctcctg 
gaccagagtc 
gcgggccgct 
ctgcgccccc 
aacaatggcc 
gggcgggagt 
tcggagcaca 
accgcctttg 
gcctttctgg 
gaagaaatcg 
ctgccctctg 
gcccagggtg 
cacaccctct 
gcgacgcagc 
agtcctcggg 
gccctggttt 
aggcagcaca 
actggagcct 
ggagccggta 
ttttaaaata 



gcatggctcc 
gcctcactgt 
tgccccggat 
tgggcgagga 
aggaggatct 
ctaaatcaga 
gagatcctca 
attggcgcta 
tccagtcccc 
tggaactcct 
acagtgtgca 
accgggctct 
ctgtggaagg 
ccagagttga 
aggagggccc 
ctgaggaagg 
acttcagccg 
tcatctggac 
ctgacaccct 
ctttgaatgg 
ctgctgagcc 
ttggcctcct 
gaaggggaac 
agaggctgga 
actgtcctgt 
aatatttata 



cctgtgcccc 
gcaactgctg 
gcaggaggat 
ggatctgccc 
acctggagag 
agaagagggc 
agaaccccag 
tggaggcgac 
ggtggatatc 
gggcttccag 
actgaccctg 
gcagctgcat 
ccaccgtttc 
cgaggccttg 
ggaagaaaac 
ctcagagact 
ctacttccaa 
tgtgtttaac 
gtggggacct 
gcgagtgatt 
agtccagctg 
ttttgctgtc 
caaagggggt 
tcttggagaa 
cctgctcatt 
at 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1552 



<210> 292 

<211> 1578 

<212> DNA 

<213> Homo sapiens 



<400> 292 

acgaacaggc 

tctcggcgtg 

cgggcctccg 

gcgctccgag 

gcaggcgcaa 

agaacagctc 

atgaagaagg 

cctcttgaaa 

gaacctgagc 

gttgatactg 

ctgtgtcagg 

gctgatccaa 

gaggaagagc 

agagccatcc 

accatgtaca 

aagatgctgc 

taccctccag 

atcagacaga 

cctttgcact 

ttggccaaat 

tctcaaattg 

acaccaactc 

cacctggcta 

aacaagtatg 

ctagttcaag 



caataaggag 
ctgcggcgga 
gtgttctgct 
tcaccaggaa 
agcgcgttcc 
ttggggacat 
aagcaaaacc 
aggtacctat 
cagaacctga 
cctctccaag 
ctttctctga 
acctttgtag 
aagcagtcag 
taattgactg 
tgactgtctc 
agctggttgg 
aaattggtga 
tggaaatgaa 
tccttcggag 
acctgatgga 
cagcaggagc 
tacaacatta 
agaatgtagt 
ccacatcgaa 
atttagccaa 



ggagcagtgc 
acggctgttg 
tctccccgct 
ctcgaaaatt 
tacggcccct 
tggtaacaaa 
ttcagctact 
gctggtgcca 
gcctgttaaa 
cccaatggaa 
tgtaattctt 
tgaatatgtg 
accaaaatac 
gctagtacag 
cattattgat 
tgtcactgcc 
ctttgctttt 
gattctaaga 
agcatctaag 
actaactatg 
tttttgctta 
cctgtcatat 
catggtaaat 
gcatgctaag 
ggctgtggca 



ggggtttaaa tctgaggcta 
gtttctgctg gttgtaggtc 
gagctgctgc ctggtgaaga 
aatgctgaaa ataaggcgaa 
gctgcaacct ccaagcccgg 
gtcagtgaac aactgcaggc 
ggaaaagtca ttgataaaaa 
gtgccagtgt ctgagccagt 
gaagaaaaac tttcgcctga 
acatctggat gtgcccctgc 
gcagtaaatg atgtggatgc 
aaagatattt atgcttatct 
ctactgggtc gggaagtcac 
gttcaaatga aattcaggtt 
cggttcatgc agaataattg 
atgtttattg caagcaaata 
gtgactgaca acacttatac 
gctttaaact ttggtctggg 
attggagagg ttgatgtcga 
ttggactatg acatggtgca 
gcactgaaaa ttctggataa 
actgaagaat ctcttcttcc 
caaggactta caaagcacat 
atcagcactc taccacagct 
aaggtgtaac ttgtaaactt 
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ggctggctct 
cttggctggt 
ggaagccatg 
gatcaacatg 
actgaggcca 
caaaatgcct 
actaccaaaa 
gccagagcca 
gcctattttg 
agaagaagac 
agaagatgga 
gagacaactt 
tggaaacatg 
gttgcaggag 
tgtgcccaag 
tgaagaaatg 
taagcaccaa 
tcggcctcta 
gcaacatact 
ctttcctcct 
tggtgaatgg 
agttatgcag 
gactgtcaag 
gaattctgca 
gagttggagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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actatattta caaataaaat tggcaccatg tgccatctgt aaaaaaaaaa aaaaaaaaaa 1560 
aaaaaaaaaa aaaaaaaa .1578 

<210> 293 

<211> 3195 

<212> DNA 

<213> Homo sapiens 

<400> 293 

agaggcttcc ctggctggtg cctgagcccg gcgtccctcg ccccccgccc tccccgcatc 60 
cctctcctcc ctcgcgcctg gccctgtggc tcttcctccc tccctccttc cccccccccc 120 
cacccctcgc ccgctgcctc cctcggccca gccagctgtg ccggcgtttg ttggctgccc 1*80 
tgcgcccggc cctccagcca gccttctgcc ggccccgccg cgatggaggt gccccagccg 240 
gagcccgcgc caggctcggc tctcagtcca gcaggcgtgt gcggtggcgc ccagcgtccg 300 
ggccacctcc cgggcctcct gctgggatct catggcctcc tggggtcccc ggtgcgggcg 360 
gccgcttcct cgccggtcac caccctcacc cagaccatgc acgacctcgc cgggctcggc 420 
agccgcagcc gcctgacgca cctatccctg tctcgacggg catccgaatc ctccctgtcg 480 
tctgaatcct ccgaatcttc tgatgcaggt ctctgcatgg attcccccag ccctatggac 540 
ccccacatgg cggagcagac gtttgaacag gccatccagg cagccagccg gatcattcga 600 
aacgagcagt ttgccatcag acgcttccag tctatgccgg tgaggctgct gggccacagc 660 
cccgtgcttc ggaacatcac caactcccag gcgcccgacg gccggaggaa gagcgaggcg 720 
ggcagtggag ctgccagcag ctctggggaa gacaaggaga atgtgcgctt ctggaaggcc 780 
ggggtgggag ctctccggga agaggagggg gcatgctggg gtggttccct ggcatgtgag 840 
gaccctcctc tcccatcttg gctgcaggat ggatttgtct tcaagatgcc atggaagccc 900 
acacatccca gctccaccca tgctctggca gagtgggcca gccgcaggga agcctttgcc 960 
cagagaccca gctcggcccc cgacctgatg tgtctcagtc ctgaccggaa gatggaagtg 1020 
gaggagctca gccccctggc cctaggtcgc ttctctctga cccctgcaga gggggatact 1080 
gaggaagatg atggatttgt ggacatccta gagagtgact taaaggatga tgatgcagtt 1140 
cccccaggca tggagagtct cattagtgcc ccactggtca agaccttgga aaaggaagag 1200 
gaaaaggacc tcgtcatgta cagcaagtgc cagcggctct tccgctctcc gtccatgccc 1260 
tgcagcgtga tccggcccat cctcaagagg ctggagcggc cccaggacag ggacacgccc 1320 
gtgcagaata agcggaggcg gagcgtgacc cctcctgagg agcagcagga ggctgaggaa 1380 
cctaaagccc gcgtcctccg ctcaaaatca ctgtgtcacg atgagatcga gaacctcctg 1440 
gacagtgacc accgagagct gattggagat tactctaagg ccttcctcct acagacagta 1500 
gacggaaagc accaagacct caagtacatc tcaccagaaa cgatggtggc cctattgacg 1560 
ggcaagttca gcaacatcgt ggataagttt gtgattgtag actgcagata cccctatgaa 1620 
tatgaaggcg ggcacatcaa gactgcggtg aacttgcccc tggaacgcga cgccgagagc 1680 
ttcctactga agagccccat cgcgccctgt agcctggaca agagagtcat cctcattttc 1740 
cactgtgaat tctcatctga gcgtgggccc cgcatgtgcc gtttcatcag ggaacgagac 1800 
cgtgctgtca acgactaccc cagcctctac taccctgaga tgtatatcct gaaaggcggc 1860 . 
tacaaggagt tcttccctca gcacccgaac ttctgtgaac cccaggacta ccggcccatg 1920 
aaccacgagg ccttcaagga tgagctaaag accttccgcc tcaagactcg cagctgggct 1980 
ggggaigcgga gccggcggga gctctgtagc cggctgcagg accagtgagg ggcctgcgcc 2040 
agtcctgcta cctcccttgc ctttcgaggc ctgaagccag ctgccctatg ggcctgccgg 2100 
gctgagggcc tgctggaggc ctcaggtgct gtccatggga aagatggtgt ggtgtcctgc 2160 
ctgtctgccc cagcccagat tcccctgtgt catcccatca ttttccatat cctggtgccc 2220 
cccacccctg gaagagccca gtctgttgag ttagttaagt tgggttaata ccagcttaaa 2280 
ggcagtattt tgtgtcctcc aggagcttct tgtttccttg ttagggttaa cccttcatct 2340 
tcctgtgtcc tgaaacgctc ctttgtgtgt gtgtcagctg aggctgggga gagecgtggt 2400 
ccctgaggat gggtcagagc taaactcctt cctggcctga gagtcagctc tctgccctgt 2460 
gtacttcccg ggccagggct gcccctaatc tctgtaggaa ccgtggtatg tctgccatgt 2520 
tgcccctttc tcttttcccc tttcctgtcc caccatacga gcacctccag cctgaacaga 2580 
agctcttact ctttcctatt tcagtgttac ctgtgtgctt ggtctgtttg actttacgcc 2640 
catctcagga cacttccgta gactgtttag gttcccctgt caaatatcag ttacccactc 2700 
ggtcccagtt ttgttgcccc agaaagggat gttattatcc ttgggggctc ccagggcaag 2760 
ggttaaggcc tgaatcatga gcctgctgga agcccagccc ctactgctgt gaaccctggg 2820 
gcctgactgc tcagaacttg ctgctgtctt gttgcggatg gatggaaggt tggatggatg 2880 
ggtggatggc cgtggatggc cgtggatgcg cagtgccttg catacccaaa ccaggtggga 2940 
gcgttttgtt gagcatgaca cctgcagcag gaatatatgt gtgcctattt gtgtggacaa 3000 
aaatatttac acttagggtt tggagctatt caagaggaaa tgtcacagaa gcagctaaac 3060 
caaggactga gcaccctctg gattctgaat ctcaagatgg gggcagggct gtgcttgaag 3120 
gccctgctga gtcatctgtt agggccttgg ttcaa:taaag cactgagcaa gttgagaaaa 3180 
aaaaaaaaaa aaaaa 3195 

<210> 294 

<211> 3737 

<212> DNA 

<213> Homo sapiens 
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ggcgtccgcg cacacctccc 
ccgcaaccgc tgagccatcc 
cggtgctgct gctgctgctg 
cgggtcgggg ccgtgccgcg 
atgactgcca tgccgacgcc 
agcctggcta ccaaggggaa 
tcaatggagg ctgtgtccat 
ttgatggctt catgttggct 
agaacaatgg cggctgccag 
gcaaggaggg gtttttcctg 
gcctgagctg catgaataag 
gcagcgtcgc ctgtgagtgc 
tcttgacctg taaccatggg 
gcccagagtg cagctgccat 
agcgagagga cactgtcctg 
ataaacgggt gaaacggcgg 
accgcacctg taaggatact 
tccagttgga tgggaagaca 
gtgatcattt ctgcaaaaac 
aattattaac agatgagaag 
gtgaccacag ctgcatcaac 
ccctgtatgg cttcacccac 
gtcagcaggt ctgtgtgaac 
agctccactg gaataaaaaa 
caccccgtgt g-tccctgcac 
gtcactctgg cattcacctc 
agctaaatga aggcaagtgt 
cagcactacc agagaagcac 
gcagctctgg caagcaagtc 
ttatcactgt tgagtttgag 
tgagctgcat cgtaaagcga 
aggccgtcca cagggagcag 
aaaagcctcc cagaacatct 
cagaaaacca atgtgtcagt 
gcattttatg tccaaatgga 
gcccaagacc aggaaattct 
gtggaggtct gtgtcaacct 
gtgccctggg cacgttccag 
gccttgccac caaacatcag 
gttcacctgg acatttctac 
cataccagcc tgaatttgga 
actttgatgg ctccacaaac 
gagatttcac tgggtacatt 
agtgtacgtg gaccatcaac 
tcttcctgcc catagaggac 
ccaattctgt gacaacatat 
ccaggtcaaa gaagctgtgg 
tccaggtccc atacgtgaca 
gagatggcag gctctatgca 
tcaaggctct gtttgatgtc 
agtcccgaga gatgtttcca 
ttttgagacc ttacaaatga 
ggttggtggg acagagctgt 
ccgtatcagt gactcattag 
gaacttggtt tttctttccc 
cagcttctca ctgctgtggg 
ttggtcagcc taggtgagac 
tgtagtggaa aggaggccac 
ccggccctct ctaagggagc 
caagagggga gggaaggaga 
actcagtttc tccacagcct 
agttctaagc agtgctcgtg 
agcacttctg gagacat 

<210> 295 
<211> 2042 
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cgcgccgccg ccgccaccgc ccgcactccg ccgcctctgc 60 
atgggggtcg cgggccgcaa ccgtcccggg gcggcctggg 120 
ctgctgccgc cactgctgct gctggcgggg gccgtcccgc 180 
gggccgcagg aggatgtaga tgagtgtgcc caagggctag 240 
ctgtgtcaga acacacccac ctcctacaag tgctcctgca 300 
ggcaggcagt gtgaggacat cgatgaatgt ggaaatgagc 360 
gactgtttga atattccagg caattatcgt tgcacttgtt 420 
catgacggtc ataattgtct tgatgtggac gagtgcctgg 480 
catacctgtg tcaacgtcat ggggagctat gagtgctgct 540 
agtgacaatc agcacacctg cattcaccgc tcggaagagg 600 
gatcacggct gtagtcacat ctgcaaggag gccccaaggg 660 
aggcctggtt ttgagctggc caagaaccag agagactgca 720 
aacggtgggt gccagcactc ctgtgacgat acagccgatg 780 
ccacagtaca agatgcacac agatgggagg agctgccttg 840 
gaggtgacag agagcaacac cacatcagtg gtggatgggg 900 
ctgctcatgg aaacgtgtgc tgtcaacaat ggaggctgtg 960 
tcgacaggtg tccactgcag ttgtcctgtt ggattcactc 1020 
tgtaaagata ttgatgagtg ccagacccgc aatggaggtt 1080 
atcgtgggca gttttgactg cggctgcaag aaaggattta 1140 
tcttgccaag atgtggatga gtgctctttg gataggacct 1200 
caccctggca catttgcttg tgcttgcaac cgagggtaca 1260 
tgtggagaca ccaatgagtg cagcatcaac aacggagg|ct 1320 
acagtgggca gctatgaatg ccagtgccac cctgggtaca 1380 
gactgtgtgg aagtgaaggg gctcctgccc acaagtgtgt 1440 
tgcggtaaga gtggtggagg agacgggtgc ttcctcagat 1500 
tcttcagatg tcaccaccat caggacaagt gtaaccttta 1560 
agtttgaaaa atgctgagct gtttcccgag ggtctgcgac 1620 
agctcagtaa aagagagctt ccgctacgta aaccttacat 1680 
ccaggagccc ctggccgacc aagcacccct aaggaaatgt 1740 
cttgaaacta accaaaagga ggtgacagct tcttgtgacc 1800 
accgagaagc ggctccgtaa agccatccgc acgctcagaa 1860 
tttcacctcc agctctcagg catgaacctc gacgtggcta 1920 
gaacgccagg cagagtcctg tggagtgggc cagggtcatg 1980 
tgcagggctg ggacctatta tgatggagca cgagaacgct 2040 
accttccaaa atgaggaagg acaaatgact tgtgaaccat 2100 
ggggccctga agaccccaga agcttggaat atgtctgaat 2160 
ggtgaatatt ctgcagatgg ctttgcacct tgccagctct 2220 
cctgaagctg gtcgaacttc ctgcttcccc tgtggaggag 2280 
ggagctactt cctttcagga ctgtgaaacc agagttcaat 2340 
aacaccacca ctcaccgatg tattcgttgc ccagtgggaa 2400 
aaaaataatt gtgtttcttg cccaggaaat actacgactg 2460 
ataacccagt gtaaaaacag aagatgtgga ggggagctgg 2520 
gaatccccaa actacccagg caattaccca gccaacaccg 2580 
ccacccccca agcgccgcat cctgatcgtg gtccctgaga 2640 
gactgtgggg actatctggt gatgcggaaa acctcttcat 2700 
gaaacctgcc agacctacga acgccccatc gccttcacct 2760 
attcagttca agtccaatga agggaacagc gctagagggt 2820 
tatgatgagg actaccagga actcattgaa gacatagttc 2880 
tctgagaacc atcaggaaat acttaaggat aagaaactta 2940 
ctggcccatc cccagaacta tttcaagtac acagcccagg 3000 
agatcgttca tccgattgct acgttccaaa gtgtccaggt 3060 
ctcagcccac gtgccactca atacaaatgt tctgctatag 3120 
cttccttctg catgtcagca cagtcgggta ttgctgcctc 3180 
agttcaattt ttatagataa tacagatatt ttggtaaatt 3240 
agcatcgtgg atgtagactg agaatggctt tgagtggcat 3300 
cggatgtctt ggatagatca cgggctggct gagctggact 3360 
tcacctgtcc ttctggggtc ttactcctcc tcaaggagtc 3420 
agaataagct. gcttattctg aaacttcagc ttcctctagc 3480 
cctctgcact cgtgtgcagg ctctgaccag gcagaacagg 3540 
cccctgcagg ctccctccac ccaccttgag acctgggagg 3600 
tctccagcct gtgtgataca agtttgatcc caggaacttg 3660 
aaaaaaaaaa gcagaaagaa ttagaaataa ataaaaacta 3720 

3737 
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<400> 295 

ggggccagtc 

cctctggggc 

gatttacacg 

actgggactt 

atagagtaac 

gtccagaaaa 

taiaaattcta 

gtggaggaga 

agagattctt 

acagggatat 

actttggctt 

gtggtacttt 

cagttgatgt 

gggaccaacc 

tcaacccttg 

agaatccatc 

ccctcaagaa 

gtggattttc 

gtgaagaaaa 

gggataccag 

catgtcctga 

acccctggca 

aatcttatca 

gtatgaatca 

tgaatttgtt 

gattggagtt 

gccagaaggt 

atatagtgct 

gcaaatagta 

ttgtttgttc 

ggatgaatag 

catctggtgg 

tcatttttat 

aattctaaga 

aa 



gttcgccgga 
ctgaggaggg 
acaaaatgcc 
ggtgcaaacc 
tgaagaagca 
tattaagaaa 
tggtcacagg 
gctttttgac 
ccatcaactc 
taaaccagaa 
ggcaacagta 
accatatgtt 
ttggtcctgt 
cagtgacagc 
gaaaaaaatc 
agcaagaatt 
aggggcaaaa 
taagcacatt 
tgtgaagtac 
cccctcatac 
tcatatgctt 
gcggttggtc 
atgcctgaaa 
ggttactata 
agaaatggat 
caagagacac 
ttggcttcct 
gctatgttga 
gttcctgaag 
ggcatacaaa 
aattcatttg 
aaaccaagtt 
caaaacattt 
tgaaccaata 



aagcatttgt 
cagaatttca 
gaggtgctcg 
ctgggagaag 
gtcgcagtga 
gagatctgta 
agagaaggca 
agaatagagc 
atggcagggg 
aatcttctgt 
tttcggtata 
gctccagaac 
ggaatagtac 
tgtcaggagt 
gattctgctc 
accattccag 
aggccccgag 
caatccaatt 
tccagttctc 
attgataaat 
ttgaatagtc 
aaaagaatga 
gagacttgtg 
tcaacaactg 
gataaaatat 
ttcctgaaga 
gccacatgat 
cattattctt 
tgttcacttc 
taatacctat 
attatttctt 
tcaggggaca 
tgtttaattc 
aagacataat 



ctcccacctc 
accttcggtg 
gtggagtcat 
gtgcctatgg 
agattgtaga 
tcaataaaat 
atatccaata 
cagacatagg 
tggtttatct 
tggatgaaag 
ataatcgtga 
ttctgaagag 
ttactgcaat 
attctgactg 
ctctagctct 
acatcaaaaa 
tcacttcagg 
tggacttctc 
agccagaacc 
tggtacaagg 
agttacttgg 
cacgattctt 
agaagttggg 
ataggagaaa 
tggttgactt 
ttaaagggaa 
cggaccatcg 
cctagagaag 
cctgtttatc 
atcttaattg 
catgtgtgtt 
tgagttttcc 
aaaaagtaca 
tcttgcaaaa 



atcataacaa 
tgcttgggag 
ggcagtgccc 
agaagttcaa 
tatgaagcgt 
gctaaatcat 
tttatttctg 
catgcctgaa 
gcatggtatt 
ggataacctc 
gcgtttgttg 
aagagaattt 
gctcgctgga 
gaaagaaaaa 
gctgcataaa 
agatagatgg 
tggtgtgtca 
tccagtaaac 
ccgcacaggt 
gatcagcttt 
caccccagga 
taccaaattg 
ctatcaatgg 
caataaactc 
ccggctttct 
gctgattgat 
gctctgggga 
attatcctgt 
caaacatctt 
taagcaaaac 
tagtatctga 
agcttttata 
tatttcttcc 
aaaaaaaaaa 



caattaattt 
tggcgattgt 
tttgtggaag 
cttgctgtga 
gccgtagact 
gaaaatgtag 
gagtactgta 
ccagatgctc 
ggaataactc 
aaaatctcag 
aacaagatgt 
catgcagaac 
gaattgccat 
aaaacatacc 
atcttagttg 
tacaacaaac 
gagtctccca 
agtgcttcta 
ctttccttat 
tcccagccca 
tcctcacaga 
gatgcagaca 
aagaaaagtt 
attttcaaag 
aagggtgatg 
attgtgagca 
atcctggtga 
cctgcaaact 
ccaatttatt 
tttggggaaa 
atttgaaact 
cacacgtatc 
atgttgattt 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 . 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2042 



<210> 296 

.<211> 2547 

<212> DNA 

<213> Homo sapiens 



<400> 296 

cttacaaggt 

agggtcgaat 

gOO^OQcicaa 

aaggaacggt 

gctcggcact 

ttccaacagc 

cttctgggaa 

cgccacatct 

cgtctaacca 

gggaacccgg 

ggcggcgcct 

ttgattggct 

tgcggagagt 

ttgaggctca 

agtcccaagg 

caaactccag 

ccactcagga 

agcctacccc 

aatgtgtgaa 

aaccactgct 

tcagggaagt 

gaacctttgt 



acagtcctct 
gcttctctga 
gacgctgcag 
tgcagtccat 
gtgggcagaa 
ggggataatt 
cccctgtcgg 
ggccggcggc 
gactaatgtt 
gttctaagtt 
gacggccaca 
gagggtggag 
gtgcggctcc 
gcagtctcat 
ctcctcctca 
ccagtcctct 
actctattct 
tgccccctgg 
tgacaactac 
gaaaagaaca 
gggtcctaaa 
aaatacagag 



gctcaggggg 
gaagtccttt 
ttgctggtac 
aaggctgaag 
gaacctgtcc 
gcttcccgcc 
ccaaaacccc 
ctatctagcc 
gctgattggc 
ccgctctccc 
tctgctgctc 
tttgtatctg 
aggtgggctc 
ggcagcagtg 
cagtcccagg 
cactccagct 
attcctgagg 
gctcgattat 
tggtttggga 
gataaatacc 
aactcttaca 
cttgtaggga 



gccaggaggg tcttataggc 
tcagtctgag acctctggct 
ctgtgctgga gcttcgctgt 
tagtctcgag tggggtcagg 
tcccgcccgg ggccccatgg 
tgcgacgcag catcgcagct 
cacacccgga gcaaagcccc 
gtggtcactc gtggggaaaa 
tggggagtcg agggggcggg 
ttctaaacta caactcccag 
ctcattggtc cggcggcagg 
caggtttagc gccactctgc 
acgcggtcgt gatgtctcgg 
cctgttcaca gccccatggc 
gcatatccag ctcctctacc 
ctgggacact gagctcctta 
accaagaacc tgaggaccaa 
gggcccttca ggatggattt 
gggacaaaag ctgtgaatat 
gaacatacag caagaaacac 
ttgcatacat agaagatcac 
aaggaaaacg ccgtcctttg 
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atcattcacc 
gaagaaatct 
atcaactctg 
tgcctgcagc 
gccgccgcag 
tagcggtctc 
ggctctcccc 
gcaaagagag 
atcacccgag 
gaggcattga 
ggagggggtt 
tggctgaggc 
gagtcggatg 
agcgttaccc 
agcacgatgc 
gagacagtgt 
gaacctgagg 
gccaatcttg 
tgctttgatg 
tttcggattt 
agtggcaatg 
aataacaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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ctgaaattgc actgtcacta agcagaaata 
atgatcagtc agtttatcct aaggcattaa 
gaagtggtgc ctgtggagag gtaaagctgg 
ccataaagat catcagcaaa aggaagtttg 
ctctcaatgt tgaaacagaa atagaaattt 
agattaaaaa cttttttgat gcagaagatt 
gagagctgtt tgacaaagtg gtggggaata 
atttttacca gatgctcttg gctgtgcagt 
acttaaagcc agagaatgtt ttactgtcat 
ctgattttgg gcactccaag attttgggag 
cccccaccta " cttggcgcct gaagttcttg 
ctgtggactg ctggagttta ggagttattc 
tctctgagca taggactcaa gtgtcactga 
tcattcctga agtctgggca gaagtctcag 
tggtagtgga tccaaaggca cgttttacga 
aggatgaaga catgaagaga aagtttcaag 
ctctacccca ggttctagcc cagccttcta 
ccgagggtgc cgagaccaca aagcgcccag 
ggtttgaaca cgaaagaaat gtaccttctt 
tgttttttta tagtttgtat tttaattatg 
tacaattaaa aacctgatgg aacctgg 

<210> 297 

<211> 2768 

<212> DNA 

<213> Homo sapiens 

<400> 297 

cactgctgtg cagggcagga aagctccatg 
tgaaaggaag actcagagga gagagataag 
acttggccat ggaaacctgg cttctcctgg 
gaacccattc aeatggactt tttaagaagc 
ttttgggaaa tattttgtcc taccataagg 
aaaagtatgg aaaagtgtgg ggcttttatg 
atcctgacat gatcaaaaca gtgctagtga 
ggccttttgg tccagtggga tttatgaaaa 
ggaagagatt acgatcattg ctgtctccaa 
tccctatcat tgcccagtat ggagatgtgt 
caggcaagcc tgtcaccttg aaagacgtct 
gcacatcatt tggagtgaac atcgactctc 
acaccaagaa gcttttaaga tttgattttt 
ttccattcct catcccaatt cttgaagtat 
caaatttttt aagaaaatct gtaaaaagga 
agcaccgagt ggatttcctt cagctgatga 
cccacaaagc tctgtccgat ctggagctcg 
gctatgaaac cacgagcagt gttctctcct 
atgtccagca gaaactgcag gaggaaattg 
cctatgatac tgtgctacag atggagtatc 
tattcccaat tgctatgaga cttgagaggg 
tgttcattcc caaaggggtg gtggtgatga 
agtactggac agagcctgag aagttcctcc 
acatagatcc ttacatatac acaccctttg 
ggtttgctct catgaacatg aaacttgctc 
aaccttgtaa agaaacacag atccccctga 
aaaaacccgt tgttctaaag gttgagtcaa 
tcctaaggac ttctgctttg ctcttcaaga 
aattactttg tgaatagaac tctgaaatga 
taaccgggga ttctgtacat gcattgagct 
gggaatataa aggaggtgac caaatcagtg 
cacgggacta tttccaccac ccccagttag 
agaatcaaca tttctcaata atttcctcca 
gatggctcta acaatgacat ttatatcaca 
tgttaaatca ataaagacca ctttacaaaa 
gagaaatcta tagaactgaa tgagaaccaa 
tgttggcgtg gggcctttgt cagaactaga 
ccactgtgac tttgcccatt gtttagaaag 
atcaggcaca gtggctcacg cctgtaatcc 
cgcctgaggt caggagttca agacaagcct 



l9740-000lPCT.txt 

aagtttttgt cttttttgat ctgactgtag 1380 

gagatgaata catcatgtca aaaactcttg 1440 

ctttcgagag gaaaacatgt aagaaagtag 1500 

ctattggttc agcaagagag gcagacccag 1560 

tgaaaaagct aaatcatcct tgcatcatca 1620 

attatattgt tttggaattg atggaagggg 1680 

aacgcctgaa agaagctacc tgcaagctct 1740 

acctteatga aaacggtatt atacaccgtg 1800 

ctcaagaaga ggactgtctt ataaagatta 1860 

agacctctct catgagaacc ttatgtggaa 1920 

tttctgttgg gactgctggg tataaccgtg 1980 

tttttatctg ccttagtggg tatccacctt 2040 

aggatcagat caccagtgga aaatacaact 2100 

agaaagctct ggaccttgtc aagaagttgt 2160 

cagaagaagc cttaagacac ccgtggcttc 2220 

atcttctgtc tgaggaaaat gaatccacag 2280 

ctagtcgaaa gcggccccgt gaaggggaag 2340 

ctgtgtgtgc tgctgtgttg tgaactccgt 2400 

tcactctgtc atctttcttt tctttgagtc 2460 

ggaataattg ctttttcaca gtcactgatg 2520 

2547 



cacatagccc agcaaagagc aacacagagc 60 
taaggaaagt agtgatggct ctcatcccag 120 
ctgtcagcct ggtgctcctc tatctatatg 180 
ttggaattcc agggcccaca cctctgcctt 240 
gcttttgtat gtttgacatg gaatgtcata 300 
atggtcaaca gcctgtgctg gctatcacag 360 
aagaatgtta ttctgtcttc acaaaccgga 420 
gtgccatctc tatagctgag gatgaagaat 480 
ccttcaccag tggaaaactc aaggagatgg 540 
tggtgagaaa tctgaggcgg gaagcagaga 600 
ttggggccta cagcatggat gtgatcacta 660 
tcaacaatcc acaagacccc tttgtggaaa 720 
tggatccatt ctttctctca ataacagtct 780 
taaatatctg tgtgtttcca agagaagtta 840 
tgaaagaaag tcgcctcgaa gatacacaaa 900 
ttgactctca gaattcaaaa gaaactgagt 960 
tggcccaatc aattatcttt atttttgctg 1020 
tcattatgta tgaactggcc actcaccctg 1080 
atgcagtttt acccaataag gcaccaccca 1140 
ttgacatggt ggtgaatgaa acgctcagat 1200 
tctgcaaaaa agatgttgag atcaatggga 1260 
ttccaagcta tgctcttcac cgtgacccaa 1320 
ctgaaagatt cagcaagaag aacaaggaca 1380 
gaagtggacc cagaaactgc attggcatga 1440 
taatcagagt ccttcaigaac ttctccttca 1500 
aattaagctt aggaggactt cttcaaccag 1560 
gggatggcac cgtaagtgga gcctgaattt 1620 
aatctgtgcc tgagaacacc agagacctca 1680 
agatgggctt catccaatgg actgcataaa 1740 
ctctcattgt ctgtgtagag tgttatactt 1800 
tgaggaggta gatttggctc ctctgcttct 1860 
caccattaac tcctcctgag ctctgataag 1920 
caaattatta atgaaaataa gaattatttt 1980 
tgttttctct ggagtattct ataagtttta 2040 
gtattatcag atgctttcct gcacattaag 2100 
caagtaaata tttttggtca ttgtaatcac 2160 
atttgattat taacataggt gaaagttaat 2220 
aatattcata gtttaattat gccttttttg 2280 
tagcagtttg ggaggctgag ccgggtggat 2340 
ggcctacatg gttgaaaccc catctctact 2400 
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aaaaatacac aaattagcta ggcatggtgg actcgcctgt aatctcacta cacaggaggc 2460 
tgaggcagga gaatcacttg aacctgggag gcggatgttg aagtgagctg agattgcacc 2520 
actgcactcc agtctgggtg agagtgagac tcagtcttaa aaaaatatgc ctttttgaag 2580 
cacgtacatt ttgtaacaaa gaactgaagc tcttattata ttattagttt tgatttaatg 2640 
ttttcagccc atctcctttc atatttctgg gagacagaaa acatgtttcc ctacacctct 2700 
tgcattccat cctcaacacc caactgtctc gatgcaatga acacttaata aaaaacagtc 2760 
gattggtc 2768 

<210> 298 

<211> 1358 

<212> DNA 

<213> Homo sapiens 

<400> 298 

ggcgtccgcg cgctgcacaa tggcggctct gaagagttgg ctgtcgcgca gcgtaacttc 60 
attcttcagg tacagacagt gtttgtgtgt tcctgttgtg gctaacttta agaagcggtg 120 
tttctcagaa ttgataagac catggcacaa aactgtgacg attggctttg gagtaaccct 180 
gtgtgcggtt cctattgcac agaaatcaga gcctcattcc cttagtagtg aagcattgat 240 
gaggagagca gtgtctttgg taacagatag cacctctacc tttctctctc agaccacata 300 
tgcgttgatt gaagctatta ctgaatatac taaggctgtt tataccttaa cttctcttta 360 
ccgacaatat acaagtttac ttgggaaaat gaattcagag gaggaagatg aagtgtggca 420 
ggtgatcata ggagccagag ctgagatgac ttcaaaacac caagagtact tgaagctgga 480 
aaccacttgg atgactgcag ttggtctttc agagatggca gcagaagctg catatcaaac 540 
tggcgcagat caggcctcta taaccgccag gaatcacatt cagctggtga aactgcaggt 600 
ggaagaggtg caccagctct cccggaaagc agaaaccaag ctggcagaag cacagataga 660 
agagctccgt cagaaaacac aggaggaagg ggaggagcgg gctgagtcgg agcaggaggc 720 
ctacctgcgt gaggattgag ggcctgagca cactgccctg tctccccact cagtggggaa 780 
agcaggggca gatgccaccc tgcccagggt tggcatgact gtctgtgcac cgagaagagg 840 
cggcaggtcc tgccctggcc aatcaggcga gacgcctttg tgagctgtga gtgcctcctg 900 
tggtctcagg cttgcgctgg acctggttct tagcccttgg gcactgcacc ctgtttaaca 960 
tttcacccca ctctgtacag ctgctcttac ccattttttt tacctcacac ccaaagcatt 1020 
ttgcctacct gggtcagaga gaggagtcct ttttgtcatg cccttaagtt cagcaactgt 1080 
ttaacctgtt ttcagtctta tttacgtcgt caaaaatgat ttagtacttg ttccctctgt 1140 
tgggatgcca gttgtggcag ggggagggga acctgtccag tttgtacgat ttctttgtat 1200 
gtatttctga tgtgttctct gatctgcccc cactgtcctg tgaggacagc tgaggccaag 1260 
gagtgaaaaa cctattacta ctaagagaag gggtgcagag tgtttacctg gtgctctcaa 1320 
caggacttaa catcaacagg acttaacaca gaaaaaaa 1358 

<210> 299 
<211> 4407 
<212> DNA 

<213> Homo sapiens 
<400> 299 

tttcgactcg cgctccggct gctgtcactt ggctctctgg ctggagcttg aggacgcaag 60 
gagggtttgt cactggcaga ctcgagactg taggcactgc catggcccct gtgctcagta 120 
aggactcggc ggacatcgag agtatcctgg ctttaaatcc tcgaacacaa actcatgcaa 180 
ctctgtgttc cacttcggcc aagaaattag acaagaaaca ttggaaaaga aatcctgata 240 
agaactgctt taattgtgag aagctggaga ataattttga tgacatcaag cacacgactc 300 
. ttggtgagcg aggagctctc cgagaagcaa tgagatgcct gaaatgtgca gatgccccgt 360 
gtcagaagag ctgtccaact aatcttgata ttaaatcatt catcacaagt attgcaaaca 420 
agaactatta tggagctgct aagatgatat tttctgacaa cccacttggt ctgacttgtg 480 
gaatggtatg tccaacctct gatctatgtg taggtggatg caatttatat gccactgaag 540 
agggacccat taatattggt ggattgcagc aatttgctac tgaggtattc aaagcaatga 600 
gtatcccaca gatcagaaat ccttcgctgc ctcccccaga aaaaatgtct gaagcctatt 660 
ctgcaaagat tgctcttttt ggtgctgggc ctgcaagtat aagttgtgct tcctttttgg 720 
ctcgattggg gtactctgac atcactatat ttgaaaaaca agaatatgtt ggtggtttaa 780 
gtacttctga aattcctcag ttccggctgc cgtatgatgt agtgaatttt gagattgagc 840 
taatgaagga ccttggtgta aagataattt gcggtaaaag cctttcagtg aatgaaatga 900 
ctcttagcac tttgaaagaa aaaggctaca aagctgcttt cattggaata ggtttgccag 960 
aacccaataa agatgccatc ttccaaggcc tgacgcagga ccaggggttt tatacatcca 1020 
aagacttttt gccacttgta gccaaaggca gtaaagcagg aatgtgcgcc tgtcactctc 1080 
cattgccatc gatacgggga gtcgtgattg tacttggagc tggagacact gccttcgact 1140 
gtgcaacatc tgctctacgt tgtggagctc gccgagtgtt catcgtcttc agaaaaggct 1200 
ttgttaatat aagagctgtc cctgaggaga tggagcttgc taaggaagaa aagtgtgaat 1260 
ttctgccatt cctgtcccca cggaaggtta tagtaaaagg tgggagaatt gttgctatgc 1320 
agtttgttcg gacagagcaa gatgaaactg gaaaatggaa tgaagatgaa gatcagatgg 1380 
tccatctgaa agccgatgtg gtcatcagtg cctttggttc agttctgagt gatcctaaag 1440 
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taaaagaagc 
aaactatgca 
acactacagt 
tacagtcaca 
ctattgatct 
gtcttgctag 
gatggggttt 
cccccagaat 
ttctgaatat 
aactaaaggc 
aaaatgactg 
taaatttatc 
atccagagct 
ttgccaagct 
gtggtgccaa 
atggcacacc 
ggacagcaat 
gatttcccat 
atagtggtgc 
tcgaagacta 
aagactggga 
gtatagctga 
agaaaatcat 
ttaagagaag 
aagcactgca 
tgattgatga 
gctaccaggc 
caggctgtac 
ggacaacacc 
gtgatttgtg 
atcatgatcc 
aatatgtaat 
aattagtggc 
cagttaattg 
attagtgtga 
agtgaaatta 
aaacatttta 
agctccataa 
cctaaaattg 
ttggtgttgg 
cgtgagatgg 
taaccaaatg 
ccccggaatt 
aaagtatgca 
gtgtgccaaa 
ttttgttatt 
catataaact 
tcagaaagta 
tgtttcatta 
atcaaataaa 



cttgagccct 
aactagtgaa 
ggaatcggtg 
atatggagct 
ggtggacatt 
cgcaactcca 
tgccctcacc 
catccgggga 
tgagctcatc 
tgacttccca 
gacggaactt 
atgtccacat 
ggtgcggaac 
gaccccaaat 
tggcgttaca 
ttggccagca 
cagacctatt 
tttggctact 
ttccgtcctc 
ctgcactggc 
tggacagagt 
actcatggac 
agcagaaaac 
ctgttttatc 
gtaccttgga 
agaaatgtgt 
tatacagttt 
tctgtgtctc 
ttatgaacca 
aaacagttgc 
ttgtgttcag 
ttcaaaatac 
ataaaataga 
gatgttcact 
cagtttccaa 
agcattttga 
tcaataaaaa 
agtacaaatg 
agtttgtgga 
atagaggata 
gtgtacaaac 
agatgatagg 
ctttaatagt 
atgcttcatt 
tcagtttgac 
attagataat 
catttatctc 
acaatctatg 
tgcactttta 
atatcataaa 
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ataaaattta acagatgggg tctcccagaa gtagatccag 1500 
gcatgggtat ttgcaggtgg tgatgtcgtt ggtttggcta 1560 
aatgatggaa agcaagcttc ttggtacatt cacaaatacg 1620 
tccgtttctg ccaagcctga actacccctc ttttacactc 1680 
agtgtagaaa tggccggatt gaagtttata aatccttttg 1740 
gccaccagca catcaatgat tcgaagagct tttgaagctg 1800 
aaaactttct ctcttgataa ggacattgtg acaaatgttt 1860 
accacctctg gccccatgta tggccctgga caaagctcct 1920 
agtgagaaaa cggctgcata ttggtgtcaa agtgtcactg 1980 
gacaacattg tgattgctag cattatgtgc agttacaata 2040 
gccaagaagt ctgaggattc tggagcagat gccctggagt 2100 
ggcatgggag aaagaggaat gggcctggcc tgtgggcagg 2160 
atctgccgct gggttaggca agctgttcag attccttttt 2220 
gtcactgata ttgtgagcat cgcaagagct gcaaaggaag 2280 
gccaccaaca ctgtctcagg tctgatggga ttaaaatctg 2340 
Qtggggattg caaagcgaac tacatatgga ggagtgtctg 2400 
gctttgagag ctgtgacctc cattgctcgt gctctgcctg 2460 
ggtggaattg actctgctga aagtggtctt cagtttctcc 2520 
caggtatgca gtgccattca gaatcaggat ttcactgtga 2580 
ctcaaagccc tgctttatct gaaaagcatt gaagaactac 2640 
ccagctactg tgagtcacca gaaagggaaa ccagttccac 2700 
aagaaactgc caagttttgg accttatctg gaacagcgca 2760 
aagattagac tgaaagaaca aaatgtagct ttttcaccac 2820 
cccaaaaggc ctattcctac catcaaggat gtaataggaa 2880 
acatttggtg aattgagcaa cgtagagcaa gttgtggcta 2940 
atcaactgtg gtaaatgcta catgacctgt aatgattctg 3000 
gatccagaaa cccacctgcc caccataacc gacacttgta 3060 
agtgtttgcc ctattgtcga ctgcatcaaa atggtttcca 3120 
aagagaggcg tacccttatc tgtgaatccg gtgtgttaag 3180 
tgtgaacttt catgtcacct acatatgctg atctcttaaa 3240 
ctctttccaa attaaaacaa atatacattt tctaaataaa 3300 
atttgtaagt gtaaaaaatg tctcatgtca atgaccattc 3360 
ataattcttt tctgaggata gtagttaaat aactgtgtgg 3420 
gccagttgtc ttatgtgaaa aattaacttt ttgtgtggca 3480 
attgccctat gctgtgctcc atatttgatt tctaattgta 3540 
aacaaagtac tctttaacat acaagaaaat gtatccaagg 3600 
ttacctttaa ttttaatgct gtttctaaga aaatgtagtt 3660 
aagaaagtca aaaattattt gctatggcag gataagaaag 3720 
ctttattaag taaaatcccc ttcgctgaaa ttgcttattt 3780 
gggagaatat ttactaacta aataccattc actactcatg 3840 
tcatccrctt ttaatggcat ttctctttaa actatgttcc 3900 
atagatcctg gttaccactc ttttactgtg cacatatggg 3960 
caccttcatg attatagcaa ctaatgtttg aacaaagctc 4020 
attcaagaat gaaaaatata atgttgataa tatatattaa 4080 
tactctctgt tttagtgttt atgtttaaaa gaaatatatt 4140 
atttttgtat ttctctattt tcataatcag taaatagtgt 4200 
ctcttcatgg catcttcaat atgaatctat aagtagtaaa 4260 
gcttatttct atgacaaatt caagagctag aaaaataaaa 4320 
gaaatgcata tttgccacaa aacctgtatt actgaataat 4380 
gcatttt 4407 



<210> 300 

<211> 5532 

<212> DNA 

<213> Homo sapiens 



<400> 300 

gccgcgctgc 

aggccacctc 

gcacggcccc 

gcagcgatgc 

tgcccggcga 

acgcagttgg 

gaggtggtcc 

ttaaagacca 

attcctttgg 

ttagcagtct 

aatttacagg 



gccggagtcc 
gtcggcgtcc 
ctgactccgt 
gaccctccgg 
gtcgggctct 
gcacttttga 
ttgggaattt 
tccaggaggt 
aaaacctgca 
tatctaacta 
aaatcctgca 



cgagctagcc ccggcgccgc cgccgcccag 
gcccgagtcc ccgcctcgcc gccaacgcca 
ccagtattga tcgggagagc cggagcgagc 
gacggccggg gcagcgctcc tggcgctgct 
ggaggaaaag aaagtttgcc aaggcacgag 
agatcatttt ctcagcctcc agaggatgtt 
ggaaattacc tatgtgcaga ggaattatga 
ggctggttat gtcctcattg ccctcaacac 
gatcatcaga ggaaatatgt actacgaaaa 
tgatgcaaat aaaaccggac tgaaggagct 
tggcgccgtg cggttcagca acaaccctgc 
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accggacgac 60 
caaccaccgc 120 
tcttcgggga 180. 
ggctgcgctc 240 
taacaagctc 300 
caataactgt 360 
tctttccttc 420 
agtggagcga 480 
ttcctatgcc 540 
gcccatgaga 600 
cctgtgcaac 660 
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gtggagagca tccagtggcg ggacatagtc 
gacttccaga accacctggg cagctgccaa 
'tgctggggtg caggagagga gaactgccag 
tgctccgggc gctgccgtgg caagtccccc 
ggctgcacag gcccccggga gagcgactgc 
acgtgcaaggi acacctgccc cccactcatg 
gtgaaccccg agggcaaata cagctttggt 
tatgtggtga cagatcacgg ctcgtgcgtc 
gaggaagacg gcgtccgcaa gtgtaagaag 
ggaataggta ttggtgaatt taaagactca 
ttcaaaaact gcacctccat cagtggcgat 
gactccttca cacatactcc tcctctggat 
aaggaaatca cagggttttt gctgattcag 
gcctttgaga acctagaaat catacgcggc 
gcagtcgtca gcctgaacat aacatccttg 
ggagatgtga taatttcagg aaacaaaaat 
aaactgtttg ggacctccgg tcagaaaacc 
tgcaaggcca caggccaggt ctgccatgcc 
gagcccaggg actgcgtctc ttgccggaat 
tgcaagcttc tggagggtga gccaagggag 
cacccagagt gcctgcctca ggccatgaac 
tgtatccagt gtgcccacta cattgacggc 
gtcatgggag aaaacaacac cctggtctgg 
ctgtgccatc caaactgcac ctacggatgc 
aatgggccta agatcccgtc catcgccact 
gtggtggccc tggggatcgg cctcttcatg 
ctgcggaggc tgctgcagga gagggagctt 
cccaaccaag ctctcttgag gatcttgaag 
ggctccggtg cgttcggcac ggtgtataag 
aaaattcccg tcgctatcaa ggaattaaga 
atcctcgatg aagcctacgt gatggccagc 
ggcatctgcc tcacctccac cgtgcaactc 
ctggactatg tccgggaaca caaagacaat 
gtgcagatcg caaagggcat gaactacttg 
gcagccagga acgtactggt gaaaacaccg 
gccaaactgc tgggtgcgga agagaaagaa 
aagtggatgg cattggaatc aattttacac 
agctacgggg tgaccgtttg ggagttgatg 
cctgccagcg agatctcctc catcctggag 
tgtaccatcg atgtctacat gatcatggtc 
ccaaagttcc gtgagttgat catcgaattc 
cttgtcattc agggggatga aagaatgcat 
cgtgccctga tggatgaaga agacatggac 
ccacagcagg gcttcttcag cagcccctcc 
agtgcaacca gcaacaattc caccgtggct 
cccatcaagg aagacagctt cttgcagcga 
gaggacagca tagacgacac cttcctccca 
aaaaggcccg ctggctctgt gcagaatcct 
cccagcagag acccacacta ccaggacccc 
ctcaacactg tccagcccac ctgtgtcaac 
cagaaaggca gccaccaaat tagcctggac 
aaggaagcca agccaaatgg catctttaag 
agggtcgcgc cacaaagcag tgaatttatt 
ctaaaaatcc agactctttc gatacccagg 
agccatgccc gcattagctc ttagacccac 
gccaggaagt acttccacct cgggcacatt 
tgtgaagcat ttacagaaac gcatccagca 
ctttcaaaga ggtatatttg aaaaaaaaaa 
ggggatcttg gagtttttca ttgtcgctat 
aggaagaagc ttgctggtag cacttgctac 
aggagcacaa gccacaagtc ttccagagga 
ttccactgca aaacactaaa gatccaagaa 
tactgtatca agtcatggca ggtacagtag 
gaagaaacgg aggggatgaa ttcttcctta 
acttactccc cactgatgga ccagtggttt 
tcttccattc cattgttttg aaactcagta 
gcaagagagg atgacacatc aaataataac 
catttggacc aatagcccac agctgagaat 
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agcagtgact ttctcagcaa catgtcgatg 720 
aagtgtgatc caagctgtcc caatgggagc 780 
aaactgacca aaatcatctg tgcccagcag 840 
agtgactgct gccacaacca gtgtgctgca 900 
ctggtctgcc gcaaattccg agacgaagcc 960 
ctctacaacc ccaccacgta ccagatggat 1020 
gccacctgcg tgaagaagtg tccccgtaat 1080 
cgagcctgtg gggccgacag ctatgagatg 1140 
tgcgaagggc cttgccgcaa agtgtgtaac 1200 
ctctccataa atgctacgaa tattaaacac 1260 
ctccacatcc tgccggtggc atttaggggt 1320 
ccacaggaac tggatattct gaaaaccgta 1380 
gcttggcctg aaaacaggac ggacctccat 1440 
aggaccaagc aacatggtca gttttctctt 1500 
ggattacgct ccctcaagga gataagtgat 1560 
ttgtgctatg caaatacaat aaactggaaa 1620 
aaaattataa gcaacagagg tgaaaacagc 1680 
ttgtgctccc ccgagggctg ctggggcccg 1740 
gtcagccgag gcagggaatg cgtggacaag 1800 
tttgtggaga actctgagtg catacagtgc 1860 
atcacctgca caggacgggg accagacaac 1920 
ccccactgcg tcaagacctg cccggcagga 1980 
aagtacgcag acgccggcca tgtgtgccac 2040 
actgggccag gtcttgaagg ctgtccaacg 2100 
gggatggtgg gggccctcct cttgctgctg 2160 
cgaaggcgcc acatcgttcg gaagcgcacg 2220 
gtggagcctc ttacacccag tggagaagct 2280 
gaaactgaat tcaaaaagat caaagtgctg 2340 
ggactctgga tcccagaagg tgagaaagtt 2400 
gaagcaacat ctccgaaagc caacaaggaa 2460 
gtggacaacc cccacgtgtg ccgcctgctg 2520 
atcacgcagc tcatgccctt cggctgcctc 2580 
attggctccc agtacctgct caactggtgt 2640 
gaggaccgtc gcttggtgca ccgcgacctg 2700 
cagcatgtca agatcacaga ttttgggctg 2760 
taccatgcag aaggaggcaa agtgcctatc 2820 
agaatctata cccaccagag tgatgtctgg 2880 
acctttggat ccaagccata tgacggaatc 2940 
aaaggagaac gcctccctca gccacccata 3000 
aagtgctgga tgatagacgc agatagtcgc 3060 
tccaaaatgg cccgagaccc ccagcgctac 3120 
ttgccaagtc ctacagactc caacttctac 3180 
gacgtggtgg atgccgacga gtacctcatc 3240 
acgtcacgga ctcccctcct gagctctctg 3300 
tgcattgata gaaatgggct gcaaagctgt 3360 
tacagctcag accccacagg cgccttgact 3420 
gtgcctgaat acataaacca gtccgttccc 3480 
gtctatcaca atcagcctct gaaccccgcg 3540 
cacagcactg cagtgggcaa ccccgagtat 3600 
agcacattcg acagccctgc ccactgggcc 3660 
aaccctgact accagcagga cttctttccc 3720 
ggctccacag ctgaaaatgc agaataccta 3780 
ggagcatgac cacggaggat agtatgagcc 3840 
accaagccac agcaggtcct ccatcccaac 3900 
agactggttt tgcaacgttt acaccgacta 3960 
ttgggaagtt gcattccttt gtcttcaaac 4020 
agaatattgt ccctttgagc agaaatttat 4080 
aaaaagtata tgtgaggatt tttattgatt 4140 
tgatttttac ttcaatgggc tcttccaaca 4200 
cctgagttca tccaggccca actgtgagca 4260 
tgcttgattc cagtggttct gcttcaaggc 4320 
ggccttcatg gccccagcag gccggatcgg 4380 
gataagccac tctgtccctt cctgggcaaa 4440 
gacttacttt tgtaaaaatg tccccacggt 4500 
ccagtcatga gcgttagact gacttgtttg 4560 
tgccgcccct gtcttgctgt catgaaatca 4620 
tcggattcca gcccacattg gattcatcag 4680 
gtggaatacc taaggataac accgcttttg 4740 
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ttctcgcaaa aacgtatctc ctaatttgag gctcagatga aatgcatcag gtcctttggg 4800 
gcatagatca gaagactaca aaaatgaagc tgctctgaaa tctcctttag ccatcacccc 4860 
aaccccccaa aattagtttg tgttacttat ggaagatagt tttctccttt tacttcactt 4920 
caaaagcttt ttactcaaag agtatatgtt ccctccaggt cagctgcccc caaaccccct 4980 
ccttacgctt tgtcacacaa aaagtgtctc tgccttgagt catctattca agcacttaca 5040 
gctctggcca caacagggca ttttacaggt gcgaatgaca gtagcattat gagtagtgtg 5100 
aattcaggta gtaaatatga aactagggtt tgaaattgat aatgctttca caacatttgc 5160 
agatgtttta gaaggaaaaa agttccttcc taaaataatt tctctacaat tggaagattg 5220 
gaagattcag ctagttagga gcccattttt tcctaatctg tgtgtgccct gtaacctgac 5280 
tggttaacag cagtcctttg taaacagtgt tttaaactct cctagtcaat atccacccca 5340 
. tccaatttat caaggaagaa atggttcaga aaatattttc agcctacagt tatgttcagt 5400 
cacacacaca tacaaaatgt tccttttgct tttaaagtaa tttttgactc ccagatcagt 5460 
cagagcccct acagcattgt taagaaagta tttgattttt gtctcaatga aaataaaact 5520 
atattcattt cc 5532 



<210> 301 

<211> 1528 

<212> DNA 

<213> Homo sapiens 

<400> 301 ' 

cggcgagcga gcaccttcga cgcggtccgg ggaccccctc gtcgctgtcc tcccgacgcg 60 
gacccgcgtg ccccaggcct cgcgctgccc ggccggctcc tcgtgtccca ctcccggcgc 120 

aegccctccc gcgagtcccg ggcccctccc gcgcccctct tctcggcgcg cgcgcagcat 180 

ggcgcccccg caggtcctcg. cgttcgggct tctgcttgcc gcggcgacgg cgacttttgc 240 

cgcagctcag gaagaatgtg tctgtgaaaa ctacaagctg gccgtaaact gctttgtgaa 300 
taataatcgt caatgccagt gtacttcagt tggtgcacaa aatactgtca tttgctcaaa 360 
gctggctgcc aaatgtttgg tgatgaaggc agaaatgaat ggctcaaaac ttgggagaag 420 

agcaaaacct gaaggggccc tccagaacaa tgatgggctt tatgatcctg actgcgatga 480 

gagcgggctc tttaaggcca agcagtgcaa cggcacctcc acgtgctggt gtgtgaacac 540 

tgctggggtc agaagaacag acaaggacac tgaaataacc tgctctgagc gagtgagaac 600 

ctactggatc atcattgaac taaaacacaa agcaagagaa aaaccttatg atagtaaaag 660 

tttgcggact gcacttcaga aggagatcac aacgcgttat caactggatc caaaatttat 720 

cacgagtatt ttgtatgaga ataatgttat cactattgat ctggttcaaa attcttctca 780 

aaaaactcag aatgatgtgg acatagctga tgtggcttat tattttgaaa aagatgttaa 840 

aggtgaatcc ttgtttcatt ctaagaaaat ggacctgaca gtaaatgggg aacaactgga 900 

tctggatcct ggtcaaactt taatttatta tgttgatgaa aaagcacctg aattctcaat 960 

gcagggtcta aaagctggtg ttattgctgt tattgtggtt gtggtgatag cagttgttgc 1020 

tggaattgtt gtgctggtta tttccagaaa gaagagaatg gcaaagtatg agaaggctga 1080 

gataaaggag atgggtgaga tgcataggga actcaatgca taactatata atttgaagat 1140 

tatagaagaa gggaaatagc aaatggacac aaattacaaa tgtgtgtgcg tgggacgaag 1200 

acatctttga aggtcatgag tttgttagtt taacatcata tatttgtaat agtgaaacct 1260 

gtactcaaaa tataagcagc ttgaaactgg ctttaccaat cttgaaattt gaccacaagt 1320 

gtcttatata tgcagatcta atgtaaaatc cagaacttgg actccatcgt taaaattatt 1380 

tatgtgtaac attcaaatgt gtgcattaaa tatgcttcca cagtaaaatc tgaaaaactg 1440 

atttgtgatt gaaagctgcc tttctattta cttgagtctt gtacatacat acttttttat 1500 
gagctatgaa ataaaacatt ttaaactg 1528 

<210> 302 
<211> 1856 
<212> DNA 

<213> Homo sapiens 
<400> 302 

ctgacttggc aggactgtgc aattgtcaga aggccgtggg gagtgggggc cagtgcctgc 60 
agcctgccct gcctctctca caggccctta gagcatcgcc aggtgcagag ctccacagct 120 
ctctttccca aggagtaatc agagggtgag aacgtggagc ctggtggaca ggtgaaagca 180 
ctgggatctt tctgcccaga aaggggaaag ttgcacattt atatcctaga gggaagcgac 240 
agcagtgctt ctccctgtgc tgaggtacag gagccatgtg gctagaaatc ctcctcactt 300 
cagtgctggg ctttgccatc tactggttca tctcccggga caaagaggaa actttgccac 360 
ttgaagatgg gtggtggggg ccaggcacga ggtccgcagc cagggaggac gacagcatcc 420 
gccctttcaa ggtggaaacg tcagatgagg agatccacga cttacaccag aggatcgata 480 
agttccgttt caccccacct ttggaggaca gctgcttcca ctatggcttc aactccaact 540 
acctgaagaa agtcatctcc tactggcgga atgaatttga ctggaagaag caggtggaga 600 
ttctcaacag ataccctcac ttcaagacta agattgaagg gctggacatc cacttcatcc 660 
acgtgaagcc cccccagctg cccgcaggcc ataccccgaa gcccttgctg atggtgcacg 720 
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gctggcccgg 
accatggcct 
gcttctcaga 
acaagctgat 
ccctgatctg 
acatggcttt 
ggaggtttct 
tattctacag 
ccgtaggctc 
tttccacctg 
tctccctgga 
cctcccagcg 
ggatgaaggt 
ctgaaaagtg 
ggggccactt 
t:gtcggtgct 
agtgccctcc 
gtcccctgcc 
acccccaact 

<210> 303 
<211> 6450 
<212> DNA 
<213> Homo 

<400> 303 

gagttgtgcc 

cctccagcac 

gggagcccag 

gccggcttca 

cgggacatgc 

ggtttctgag 

atgaccatga 

aacgagctgg 

gaggtgtacc 

gagttcaacg 

ggccccgggt 

aacagcgtgt 

ctgcagcccc 

gtgcgcgagg 

ggcagagaaa 

gagactcgct 

tcctgtgagg 

tgtccagcca 

cggctccgca 

ggagggagaa 

gggtctgctg 

tctaagaaga 

gatgctgagc 

tcgatgatgg 

gcgaagaggg 

tgtgcctggc 

aagctactgt 

atggtggaga 

cagggagagg 

tttctgtcca 

aagatcacag 

caccagcggc 

ggcatggagc 

ctggagatgc 

gaggagacgg 

aagtattaca 

tcccacacgg 

aattctgtct 

ggccattcat 

ccaaagggat 

aagcgtgagg 

taactctgtg 



39740-000lPCT.txt 
ctctttctac gagttttata agatcatccc actcctgact gaccccaaga 780 
gagcgatgag cacgtttttg aagtcatctg cccttccatc cctggctatg 840 
ggcatcctcc aagaaggggt tcaactcggt ggccaccgcc aggatctttt 900 
gctgcggctg ggcttccagg aattctacat tcaaggaggg gactgggggt 960 
cactaatatg gcccagctgg tgcccagcca cgtgaaaggc ctgcacttga 1020 
ggttttaagc aacttctcta ccctgaccct cctcctggga cagcgtttcg 1080 
tggcctcact gagagggatg tggagctgct gtaccccgtc aaggagaagg 1140 
cctgatgagg gagagcggct acatgcacat ccagtgcacc aagcctgaca 1200 
tgctctgaat gactctcctg tgggtctggc tgcctatatt ctagagaagt 1260 
gaccaatacg gaattccgat acctggagga tggaggcctg gaaaggaagt 1320 
cgacctgctg accaacgtca tgctctactg gacaacaggc accatcatct 1380 
cttctacaag gagaacctgg gacagggctg gatgacccag aagcatgagc 1440 
ctatgtgccc actggcttct ctgccttccc ttttgagcta ttgcacacgc 1500 
ggtgaggttc aagtacccaa agctcatctc ctattcctac atggttcgtg 1560 
tgcggccttt gaggagccgg agctgctcgc ccaggacatc cgcaagttcc 1620 
Qgagcggcaa tgacccaccc ctctcccccc gcctgccacc tccccccaca 1680 
aggcttttct tggggaagat accccttttc tgaggaatga gtttgcctcc 1740 
catgctggga gcccacgctc accccctcac ccctccaagc tcactcccca 1800 
ccgtgtggta agcaacatgg ctttgatgat aaacgacttt act eta .1856 



sapiens 



tggagtgatg tttaagccaa tgtcagggca aggcaacagt ccctggccgt 60 
ctttgtaatg catatgagct cgggagacca gtacttaaag ttggaggccc 120 
gagctggcgg agggcgttcg tcctgggagc tgcacttgct ccgtcgggtc 180 
ccggaccgca ggctcccggg gcagggccgg ggccagagct cgcgtgtcgg 240 
gctgcgtcgc ctctaacctc gggctgtgct ctttttccag gtggcccgcc 300 
ccttctgccc tgcggggaca cggtctgcac cctgcccgcg gccacggacc 360 
ccctccacac caaagcatct gggatggccc tactgcatca gatccaaggg 420 
agcccctgaa ccgtccgcag ctcaagatcc ccctggagcg gcccctgggc 480 
tggacagcag caagcccgcc gtgtacaact accccgaggg cgccgcctac 540 
ccgcggccgc cgccaacgcg caggtctacg gtcagaccgg cctcccctac 600 
ctgaggctgc ggcgttcggc tccaacggcc tggggggttt ccccccactc 660 
ctccgagccc gctgatgcta ctgcacccgc cgccgcagct gtcgcctttc 720 
acggccagca ggtgccctac tacctggaga acgagcccag cggctacacg 780 
ccggcccgcc ggcattctac aggccaaatt cagataatcg acgccagggt 840 
gattggccag taccaatgac aagggaagta tggctatgga atctgccaag 900 
actgtgcagt gtgcaatgac tatgcttcag gctaccatta tggagtctgg 960 
gctgcaaggc cttcttcaag agaagtattc aaggacataa cgactatatg 1020 
ccaaccagtg caccattgat aaaaacagga ggaagagctg ccaggcctgc 1080 
aatgctacga agtgggaatg atgaaaggtg ggatacgaaa agaccgaaga 1140 
tgttgaaaca caagcgccag agagatgatg gggagggcag gggtgaagtg 1200 
gagacatgag agctgccaac ctttggccaa gcccgctcat gatcaaacgc 1260 
acagcctggc cttgtccctg acggccgacc agatggtcag tgccttgttg 1320 
cccccatact ctattccgag tatgatccta ccagaccctt cagtgaagct 1380 
gcttactgac caacctggca gacagggagc tggttcacat gatcaactgg 1440 
tgccaggctt tgtggatttg accctccatg atcaggtcca ccttctagaa 1500 
tagagatcct gatgattggt ctcgtctggc gctccatgga gcacccagtg 1560 
ttgctcctaa cttgctcttg gacaggaacc agggaaaatg tgtagagggc 1620 
tcttcgacat gctgctggct acatcatctc ggttccgcat gatgaatctg 1680 
agtttgtgtg cctcaaatct attattttgc ttaattctgg agtgtacaca 1740 
gcaccctgaa gtctctggaa gagaaggacc atatccaccg agtcctggac 1800 
acactttgat ccacctgatg gccaaggcag gcctgaccct gcagcagcag 1860 
tggcccagct cctcctcatc ctctcccaca tcaggcacat gagtaacaaa 1920 
atctgtacag catgaagtgc aagaacgtgg tgcccctcta tgacctgctg 1980 
tggacgccca ccgcctacat gcgcccacta gccgtggagg ggcatccgtg 2040 
accaaagcca cttggccact gcgggctcta cttcatcgca ttccttgcaa 2100 
tcacggggga ggcagagggt ttccctgcca cagtctgaga gctccctggc 2160 
ttcagataat ccctgctgca ttttaccctc atcatgcacc actttagcca 2220 
cctgcataca ctccggcatg catccaacac caatggcttt ctagatgagt 2280 
ttgcttgctc agttcttagt ggcacatctt ctgtcttctg ttgggaacag 2340 
tccaaggcta aatctttgta acagctctct ttcccccttg ctatgttact 2400 
attcccgtag ctcttcacag ctgaactcag tctatgggtt ggggctcaga 2460 
catttaagct acttgtagag acccaggcct ggagagtaga cattttgcct 2520 
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ctgataagca ctttttaaat ggctctaaga ataagccaca gcaaagaatt taaagtggct 2580 
cctttaattg gtgacttgga gaaagctagg tcaagggttt attatagcac cctcttgtat 2640 
tcctatggca atgcatcctt ttatgaaagt ggtacacctt aaagctttta tatgactgta 2700 
gcagagtatc tggtgattgt caattcactt ccccctatag gaatacaagg ggccacacag 2760 
ggaaggcaga tcccctagtt ggccaagact tattttaact tgatacactg cagattcaga 2820 
gtgtcctgaa gctctgcctc tggctttccg gtcatgggtt ccagttaatt catgcctccc 2880 
atggacctat ggagagcaac aagttgatct tagttaagtc tccctatatg agggataagt 2940 
tcctgatttt tgtttttatt tttgtgttac aaaagaaagc cctccctccc tgaacttgca 3000 
gtaaggtcag cttcaggacc tgttccagtg ggcactgtac ttggatcttc ccggcgtgtg 3060 
tgtgccttac acaggggtga actgttcact gtggtgatgc atgatgaggg taaatggtag 3120 
ttgaaaggag caggggccct ggtgttgcat ttagccctgg ggcatggagc tgaacagtac 3180 
ttgtgcagga ttgttgtggc tactagagaa caagagggaa agtagggcag aaactggata 3240 
cagttctgag cacagccaga cttgctcagg tggccctgca caggctgcag ctacctagga 3300 
acattccttg cagaccccgc attgcctttg ggggtgccct gggatccctg gggtagtcca 3360 
gctcttattc atttcccagc gtggccctgg ttggaagaag cagctgtcaa gttgtagaca 3420 
gctgtgttcc tacaattggc ccagcaccct ggggcacggg agaagggtgg ggaccgttgc 3480 
tgtcactact caggctgact ggggcctggt cagattacgt atgcccttgg tggtttagag 3540 
ataatccaaa atcagggttt ggtttgggga agaaaatcct cccccttcct cccccgcccc 3600 
gttccctacc gcctccactc ctgccagctc atttccttca atttcctttg acctataggc 3660 
taaaaaagaa aggctcattc cagccacagg gcagccttcc ctgggccttt gcttctctag 3720 
cacaattatg ggttacttcc tttttcttaa caaaaaagaa tgtttgattt cctctgggtg 3780 
accttattgt ctgtaattga aaccctattg agaggtgatg tctgtgttag ccaatgaccc 3840 
aggtagctgc tcgggcttct cttggtatgt cttgtttgga aaagtggatt tcattcattt 3900 
ctgattgtcc agttaagtga tcaccaaagg actgagaatc tgggagggca aaaaaaaaaa 3960 
aaaaagtttt tatgtgcact taaatttggg . gacaatttta tgtatctgtg ttaaggatat 4020 
gcttaagaac ataattcttt tgttgctgtt tgtttaagaa gcaccttagt ttgtttaaga 4080 
agcaccttat atagtataat atatattttt ttgaaattac attgcttgtt tatcagacaa 4140 
ttgaatgtag taattctgtt ctggatttaa tttgactg'gg ttaacatgca aaaaccaagg 4200 
aaaaatattt agtttttttt tttttttttg tatacttttc aagctacctt gtcatgtata 4260 
cagtcattta tgcctaaagc ctggtgatta ttcatttaaa tgaagatcac atttcatatc 4320 
aacttttgta tccacagtag acaaaatagc actaatccag atgcctattg ttggatattg 4380 
aatgacagac aatcttatgt agcaaagatt atgcctgaaa aggaaaatta ttcagggcag 4440 
ctaattttgc ttttaccaaa atatcagtag taatattttt ggacagtagc taatgggtca 4500 
gtgggttctt tttaatgttt atacttagat tttcttttaa aaaaattaaa ataaaacaaa 4560 
aaaaatttct aggactagac gatgtaatac cagctaaagc caaacaatta tacagtggaa 4620 
ggttttacat tattcatcca atgtgtttct attcatgtta agatactact acatttgaag 4680 
tgggcagaga acatcagatg attgaaatgt tcgcccaggg gtctccagca actttggaaa 4740 
tctctttgta tttttacttg aagtgccact aatggacagc agatattttc tggctgatgt 4800 
tggtattggg tgtaggaaca tgatttaaaa aaaaaactct tgcctctgct ttcccccact 4860 
ctgaggcaag ttaaaatgta aaagatgtga tttatctggg gggctcaggt atggtgggga 4920 
agtggattca ggaatctggg gaatggcaaa tatattaaga agagtattga aagtatttgg 4980 
aggaaaatgg ttaattctgg gtgtgcacca aggttcagta gagtccactt ctgccctgga 5040 
gaccacaaat caactagctc catttacagc catttctaaa atggcagctt cagttctaga 5100 
gaagaaagaa caacatcagc agtaaagtcc atggaatagc tagtggtctg tgtttctttt 5160 
cgccattgcc tagcttgccg taatgattct ataatgccat catgcagcaa ttatgagagg 5220 
ctaggtcatc caaagagaag accctatcaa tgtaggttgc aaaatctaac ccctaaggaa 5280 
gtgcagtctt tgatttgatt tccctagtaa ccttgcagat atgtttaacc aagccatagc 5340 
ccatgccttt tgagggctga acaaataagg gacttactga taatttactt ttgatcacat 5400 
taaggtgttc tcaccttgaa atcttataca ctgaaatggc cattgattta ggccactggc 5460 
ttagagtact ccttcccctg catgacactg attacaaata ctttcctatt catactttcc 5520 
aattatgaga tggactgtgg gtactgggag tgatcactaa caccatagta atgtctaata 5580 
ttcacaggca gatctgcttg gggaagctag ttatgtgaaa ggcaaataaa gtcatacagt 5640 
agctcaaaag gcaaccataa ttctctttgg tgcaagtctt gggagcgtga tctagattac 5700 
actgcaccat tcccaagtta atcccctgaa aacttactct caactggagc aaatgaactt 5760 
tggtcccaaa tatccatctt ttcagtagcg ttaattatgc tctgtttcca actgcatttc 5820 
ctttccaatt gaattaaagt gtggcctcgt ttttagtcat ttaaaattgt tttctaagta 5880 
attgctgcct ctattatggc acttcaattt tgcactgtct tttgagattc aagaaaaatt 5940 
tctattcatt tttttgcatc caattgtgcc tgaactttta aaatatgtaa atgctgccat 6000 
gttccaaacc catcgtcagt gtgtgtgttt agagctgtgc accctagaaa caacatactt 6060 
gtcccatgag caggtgcctg agacacagac ccctttgcat tcacagagag gtcattggtt 6120 
atagagactt gaattaataa gtgacattat gccagtttct gttctctcac aggtgataaa 6180 
caatgctttt tgtgcactac atactcttca gtgtagagct cttgttttat gggaaaaggc 6240 
tcaaatgcca aattgtgttt gatggattaa tatgcccttt tgccgatgca tactattact 6300 
gatgtgactc ggttttgtcg cagctttgct ttgtttaatg aaacacactt gtaaacctct 6360 
tttgcacttt gaaaaagaat. ccagcgggat gctcgagcac ctgtaaacaa ttttctcaac 6420 
ctatttgatg ttcaaataaa gaattaaact 6450 
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<210> 304 
<211> 3336 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> CO) ■ . . (0) 

<223> n = A, T, c or G 



<400> 304 
cggcggcgac 
agattcataa 
cttcctgttc 
caacaggagt 
tttccagctg 
atccccaaca 
agtggcagta 
cctccaggac 
ctggagacct 
gccctttgcg 
aacaatagcc 
cgcagcatca 
aaggttgagg 
ccctactctt 
actttgaaag 
aagccaggct 
gagacgtctg 
ttgacattgg 
ttggaatcac 
gaactccccc 
ctggtaccta 
ccattgcccc 
cgcattgccc 
ccagggaaag 
actatcaagg 
aaagtggaga 
tcatctcact 
stgtgggctta 
aggagggaga 
ccggagctgc 
ccagcagact 
tttaagacac 
cccagaaccc 
agcccagtac 
gatctcagca 
agttcagaac 
gacgtcccca 
ctgacagaag 
atcagctttc 
tttattcctg 
ccgggcactc 
gttctgctcc 
cactgctggg 
ctgaagggtg 
tccattctct 
tagtataaat 
tatccagaga 
ttgcttttag 
acctggatct 
ctttttgccc 
. tcctggttaa 
tgcaactgaa 
tccccaatca 
cgaaagggcc 
ccctaggccc 
aaagcgaagg 



tgcagtctgg 

tgaaagctag 

aaaatgcccc 

ctaatcaagc 

ggatcaagat 

atgctaatat 

gtgggcccaa 

tccggcctca 

tgggaccaaa 

agcagaaacg 

tatccaacat 

agcaagagat 

agccttcgag 

acatggccat 

acatctatac 

ggaagaactc 

ccaatggcaa 

accaggtgtt 

agcagaaacg 

tgggcgcacg 

tccagttccc 

tggcggcttc 

ccaaggtgct 

aggagaaact 

aggaagaaat 

gccctccctt 

cctgggagga 

ggtccccaac 

ggagccggtc 

tcttctcaga 

cctctgaccc 

ccattaagga 

ctgaatcctg 

aaacctccca 

ccactccctt 

ccttagacct 

agccaggctc 

gcctggtcct 

ctggcctgga 

agctacagta 

caaggctcag 

tcatagctcc 

accttgtgtt 

ggaacaacaa 

gcccagcagt 

tctccaaatt 

ctgccagaag 

ttttgataga 

tgggttcttc 

ctccctgcca 

aaaagtcttt 

gcagagtgtg 

taccagggag 

cctgacctgc 

cagctgaccg 

tggaccnaaa 



agggtccaca 
cccccgtcgg 
aagtgaaaca 
agaggcctcc 
tattaaccac 
tcacagcatc 
caaattcatc 
aacccaaacc 
acctgcagct 
ggagacctgt 
ccagtggctt 
ggaggaaaag 
accatcagcg 
gatacaattc 
gtggattgag 
catccgccac 
ggtctccttc 
taagccactg 
accgaatcca 
gcggaagatg 
ggtgaaccag 
cctcatgagc 
gctagctgag 
cctgtttgga 
ccagcctggg 
ggaagagtgg 
ttcgtcccaa 
ccggtgtgtc 
tcggaggaaa 
ggggcccagt 
tgcctcccag 
aacgctgccc 
gaggctcacg 
gggtgcctct 
gcaaagtgct 
catctccgtc 
cccggagcca 
ggacacaatg 
cgaggaccca 
gagccctgcc 
tgcaccccaa 
ctgctgcctg 
ccccaagagt 
aggcaatggt 
ctcttacctt 
atcctctaat 
gtgggtagga 
agggaagacc 
actgcaggga 
cctccccgtg 
tgtattgggt 
ggtgcccaga 
actggcattg 
ctggcttcct 
catgggtgtg 
aaaaaaaaaa 



cttgtgattc tcaatggaga gtgaaaacgc 60 
ccactgattc tcaaaagacg gaggctgccc 120 
tcagaggagg aacctaagag atcccctgcc 180 
aaggaagtgg cggagtccaa ctcttgcaag 240 
cccaccatgc ccaacacgca agtagtggcc 300 
atcacagcac tgactgccaa gggaaaagag 360 
ctcatcagct gtgggggagc cccaactcag 420 . 
agctatgatg ccaaaaggac agaagtgacc 480 
agggatgtga atcttcctag accacctgga 540 
gcagatggtg aggcagcagg ctgcactatc 600 
cgaaagatga gttctgatgg actgggctcc 660 
gagaattgtc acctggagca gcgacaggtt 720 
tcctggcaga actctgtgtc tgagcggcca 780 
gccatcaaca gcactgagag gaagcgcatg 840 
gaccactttc cctactttaa gcacattgcc 900 
aacctttccc tgcacgacat gtttgtccgg 960 
tggaccattc accccagtgc caaccgctac 1020 
gacccagggt ctccacaatt gcccgagcac 1080 
gagctccgcc ggaacatgac catcaaaacc 1140 
aagccactgc taccacgggt cagctcatac 1200 
tcactggtgt tgcagccctc ggtgaaggtg 1260 
tcagagcttg cccgccatag caagcgagtc 1320 
gaggggatag ctcctctttc ttctgcagga 1380 
gaagggtttt ctcctttgct tccagttcag 1440 
gaggaaatgc cacacttagc gagacccatc 1500 
ccctccccgg ccccatcttt caaagaggaa 1560 
tctcccaccc caagacccaa gaagtcctac 1620 
tcggaaatgc ttgtgattca acacagggag 1680 
cagcatctac tgcctccctg tgtggatgag 1740 
acttcccgct gggccgcaga gctcccgttc 1800 
ctcagctact cccaggaagt gggaggacct 1860 
atctcctcca ccccgagcaa atctgtcctc 1920 
cccccagcca aagtaggggg actggatttc 1980 
gaccccttgc ctgaccccct ggggctgatg 2040 
cccccccttg aatcaccgca aaggctcctc 2100 
ccctttggca actcttctcc ctcagatata 2160 
caggtttctg gccttgcagc caatcgttct 2220 
aatgacagcc tcagcaagat cctgctggac 2280 
ctgggccctg acaacatcaa ctggtcccag 2340 
cttgcccctg tgctcaagct gtccaccatc 2400 
gcctctgagt gaggacagca ggcagggact 2460 
attatgcaaa agtagcagtc acaccctagc 2520 
atctgattcc tctgctgtcc ctgccaggag 2580 
gaaaagagat taggaacccc ccagcctgtt 2640 
ccctgatctt tgcagggtgg tccgtgtaaa 2700 
tataaatgta agcttatttc cttagatcat 2760 
tgacctgggg tttcaattga cttctgttcc 2820 
tgcagtgcac ggtttcttcc aggctgaggt 2880 
cccagacaag tggatctgct tgccagagtc 2940 
tttccaagtc agctttcctg caagaagaaa 3000 
caggagttga atttggggtg ggaggatgga 3060 
tgtgcgctat tagatgtttc tctgataatg 3120 
acgagaactc aggtggaggc ttgagaaggc 3180 
tagcttgccc ctcagctttg caaagagcca 3240 
agccagcttg agaacactaa ctactcaata 3300 
aaaaaa 3336 
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39740-OOOlPCT, txt 



<210> 305 

<211> 2365 

<212> DNA 

<213> Homo sapiens 



<400> 305 

tcccagcctt 

ggcaggagcc 

ccggcgagag 

ggtgagccac 

cctcagccac 

ttttaacatc 

cacggtgcag 

tcatggatcc 

cccctggaat 

ctccgtctac 

cttcaccttc 

aggctcggcc 

gcccgacagc 

agcctcctcg 

ctccggactc 

gtccaggccc 

gaccccactg 

tcacaaaatg 

caggagagca 

gaatgccaat 

taacagaccc 

caaatccaaa 

gtttaacccg 

ccactccagc 

tggaccacac 

acagggcccc 

atcatgaagc 

gccactttgc 

tctgtgaata 

aaaaaaaaat 

tattgcatct 

aaatagtgtg 

ttgtttcctt 

aagagaaaag 

cctctgcaaa 

agggtttcag 

tgtttgtatg 

ttatacagac 

tcgtttgttt 

cattcagttg 



cccatccccc 
ccccgacctc 
ggcgcgacga 
caccaccccg 
tcctacatgg 
gacggtcaag 
aggtaccctc 
ctaccctggc 
ctcagcccct 
cccccggcct 
ccgcccaccc 
ggctcggccc 
atgaagctgg 
tcgacccacc 
ttccccccca 
aaggcccggt 
tggcggcgag 
aacggacaga 
gggacgtcct 
ggggaccctg 

ctgactatga 
aagtgcaaaa 
gccgccctct 
cacatgctga 
cacccctcca 
cagcgagagt 
ctaaacgcga 
aaaggagctc 
agccattctg 
cctgaacatt 
gggtagctgt 
gaaattaaga 
tcactggcca 
aaaaaaaaag 
ggaaatacca 
agagcctttt 
tataattcaa 
cgaactgttg 
gtttcaatat 
gcaaaaaaaa 



caccgaaagc 
ccaggcggac 
cagccgaggc 
ccgtgctcaa 
acgcggcgca 
gcaaccacgt 
cgacccacca 
tggacggcgg 
tctccaagac 
cgtcctcctc 
cgccgaagga 
ggcaggacga 
agtcgtccca 
accccatcac 
gcagcctgct 
ccagcacagg 
atggcacggg 
accggcccct 
gtgcgaactg 
tctgcaatgc 
agaaggaagg 
aagtgc^tga 
ccagacacat 
ccacgcccac 
gcatggtcac 
ccctgcagtc 
tggatatatg 
actgtggtgt 
actcatatcc 
gcatataact 
aaggcatgaa 
agaaactagg 
cagttgtttg 
aaaaaagttg 
gttctgggca 
tctaggccta 
agcaccaaaa 
tataaattta 
tttccttctc 
aaaaa 



aaatcattca 
cgcccttccc 
catggaggtg 
cgggcagcac 
gtacccgctg 
cccgccctac 
cgggagccag 
caaagccctg 
gtccatccac 
cttgtcgggg 
cgtctccccg 
gaaagagtgc 
ctcccgtggc 
cacctacccg 

gggcggctcc 

cagggagtgt 
acactacctg 
cattaagccc 
tcagaccacc 
ctgtgggctc 
catccagacc 
ctcactggag 
gtcctccctg 
gccgatgcac 
cgccatgggt 
cctttcgact 
tttttgaagg 
ctgtgttcca 
cctatttaac 
tatattgtaa 
ggatgccaag 
tctgatattc 
atgcattaaa 
taggcgaatc 
atcagtgtta 
catgctttgt 
taagaaaaga 
tttactgcta 
tctcaatttt 



acgacccccg 
tccccgcgcg 
acggcggacc 
ccggacacgc 
ccggaggagg 
tacggaaact 
gtgtgccgcc 
ggcagccacc 
cacggctccc 
ggccacgcca 
gacccatcgc 
ctcaagtacc 
agcatgaccg 
ccctacgtgc 
cccaccggct 
gtgaactgtg 
tgcaacgcct 
aagcgaaggc 
acaaccacac 
tactacaagc 
agaaaccgaa 
gacttcccca 
agccacatct 
ccgccatcca 
tagagccctg 
tgcatttttg 
cagaaagcaa 
accactgaat 
agggtctcta 
gaaatactgt 
aagtttaagg 
aaatggacaa 
agaaaataaa 
atttgttcaa 
ccgttcacca 
gaacaagtcc 
tgtagattta 
gtcttaagaa 
cggttgaata 



accctccgac 
ggttccgggc 
agccgcgctg 
accacccggg 
tggatgtgct 
cggtcagggc 
cgcctctgct 
acaccgcctc 
cggggcccct 
gcccgcacct 
tgtccacccc 
aggtgcccct 
ccctgggtgg 
ccgagtacag 
tcggatgcaa 
gggcaacctc 
gcgggctcta 
tgtctgcagc 
tctggaggag 
ttcacaatat 
aaatgtctag 
agaacagctc 
cgcccttcag 
gcctgtcctt 
ctcgatgctc 
caggagcagt 
aattatgttt 
ctggacccca 
gtgctgtgaa 
acaatgactt 
aatatgggag 
actgccagtt 
aaaaagaaaa 
agctgttggc 
gttgccattg 
ctgtaattgt 
tttcatcata 
ctgctttctt 
aactagatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2365 



<210> 306 

<211> 1117 

<212> DNA 

<213> Homo sapiens 



<400> 306 

gcaccaacca 

gccatccgcc 

ggggacgctc 

gactttccca 

atcttgtgct 

cgtgtggaca 

tacaatccag 

aagctctact 

gtagattttc 

gacgccttcc 

gcctacatga 

ggcaacaagt 

gcccaggctg 

tcctgtttat 



gcaccatgcc 
tgctcctgga 
ctgattatga 
atctgcccta 
acattgcccg 
ttttggagaa 
aatttgagaa 
cagagtttct 
tcgtctatga 
caaatctgaa 
agtccagccg 
agggccttga 
tgcagcgcag 
tcccatcttt 



catgatactg 
atacacagac 
cagaagccag 
cttgattgat 
caagcacaac 
ccagaccatg 
actgaagcca 
ggggaagcgg 
tgtccttgac 
ggacttcatc 
cttcctccca 
aggcaggagg 
ctggactctg 
actcccaaga 



gggtactggg acatccgcgg 
tcaagctatg aggaaaagaa 
tggctgaatg aaaaattcaa 
ggggctcaca agatcaccca 
ctgtgtgggg agacagaaga 
gacaaecata tgcagctggg 
aagtacttgg aggaactccc 
ccatggtttg caggaaacaa 
ctccaccgta tatttgagcc 
tcccgctttg agggcttgga 
agacctgtgt tctcaaagat 
tgggagtgag gagcccatac 
catcccagca cctgcctcct 
cttcattgtc cctcttcact 
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gctggcccac 60 
gtacacgatg 120 
gctgggcctg 180 
gagcaacgcc 240 
ggagaagatt 300 
catgatctgc 360 
tgaaaagcta 420 
gatcactttt 480 
caactgcttg 540 
gaagatctct 600 
ggctgtctgg 660 
tcagcctgct 720 
cgttcctttc 780 
ccccctaaac 840 



wo 2003/078662 



PCT/US2003/007713 



ccctgtccca 
tcccatcatt 
tgtctgcttt 
tgttgcatga 
gcctaggcct 



tgcaggccct 
acccttccct 
aaagcctgcc 
aggagcagca 
acctgatgga 



39740-000lPCT.txt 
ttgaagcctc agctacccac tatccttcgt gaacatcccc 900 
gcactaaagc cagcctgacc ttccttcctg ttagtggttg 960 
tggcccctcg cctgtggagc tcagccccga gctgtccccg 1020 
ttgactggtt tacaggccct gctcctgcag catggtccct 1080 
agtaaagcct caaccac 1117 



<210> 307 

<211> 1266 

<212> DNA 

<213> Homo sapiens 



<400> 307 

ctcggaagcc 

gtgggctggc 

aacggtacac 

tcaagctaga 

cccagagcaa 

aagaagaaaa 

tgataaggct 

tacctggaca 

aaaagctcac 

accccaagtg 

tggagaaaat 

agatggccca 

ttttgtttca 

cacttatgtt 

ggatatgtgg 

actccccaac 

aaaaaacgag 

aagactgggt 

gaaaccgtcc 

ggctagccaa 

ttgacactat 

attgat 



cgtcaccatg 
gcacgccatc 
gtgcggggaa 
cctggacttt 
tgccatcttg 
gattcgagtg 
ctgttacagc 
actgaaacaa 
ctttgtggat 
cctggatgag 
cgctgcctac 
gtggggcaac 
tcctgtccgt 
actggtgtcc 
agaatcatca 
tgagttcaag 
attgcacagt 
gaggataaca 
tgtgtacaca 
tagagttggc 
tcactaaaat 



tcgtgcgagt 
cgcctgctcc 
gctcctgact 
cctaatctgc 
cgctacatcg 
gacatcatag 
tctgaccacg 
ttctccatgt 
tttctcacct 
ttcccaaacc 
ttacagtctg 
aagcctgtat 
aaggggtcag 
agctgagttt 
agatatgaat 
ggctgtaggt 
tggagagagc 
caagtaaaac 
tgggagttta 
aattgcttat 
taatctggaa 



cgtctatggt 
tggagttcac 
atgatcgaag 
cctacctcct 
ctcgcaagca 
agaaccaagt 
aaaaactgaa 
ttctgtggaa 
atgatatctt 
tgaaggcttt 
atcagttctg 
gctgagcagg 
cgctcttgct 
ctcttgggta 
tgaatcgctg 
tcatgcccaa 
aggtgtgtta 
tgtggtactg 
gtgtgataaa 
tgaaactcat 
tttaaggccc 



tctcgggtac 
ggatacctct 
Gcaatggctg 
ggatgggaag 
caacatgtgt 
aatggatttc 
gcctcagtac 
attctcatgg 
ggatcagaac 
catgtgccgt 
caagatgccc 
aggcagactt 
ttgctctttt 
taaaggctaa 
cgatactgtg 
gccctgagag 
aatggactgg 
atggacttaa 
ggcagtattt 
taaaaataat 
aacattaaac 



tgggatattc 
tatgaggaga 
gatgtgaaat 
aacaagatca 
ggtgagactg 
cgcacacaac 
ttggaagagc 
tttgccgggg 
cgtatatttg 
tttgaggctt 
atcaacaaca 
gcagagcttg 
caatgaatag 
aagggaaaaa 
gcatttccct 
tgggtactag 
agtcccngtg 
ccggagttcg 
cagactggtg 
agagccccac 
acaaagctgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1266 



<210> 308 

<211> 2162 

<212> DNA 

<213> Homo sapiens 



<400> 308 

gggctgcgct 

ggggcccgca 

tcagcaggcc 

gacaaccagt 

gggtctggga 

ccgcctggac 

ggcgtcagcc 

caatgtcatg 

cctgcatcag 

ggtgtttctg 

catcaccatg 

gtttggtctg 

cctagcaggc 

atcagaaagt 

ctgttacctg 

aggacccggg 

aggcaaagag 

tatcaaagcc 

tatcaccatt 

cagcacctgg 

ctggttgggc 

gccaagcctg 

gccccgccgc 

tgcctttgcc 

agtgaagcca 

tctggcactg 

cagaaggact 



gtccagctgt 
ggcctgggaa 
cctgagggag 
caccagcctc 
acgctgctcc 
atgtcccaga 
gcccctgcag 
accctatgtg 
aggatccccc 
atcactgcca 
atcaagatcg 
gctggccttc 
ttctttgcct 
gccttcggct 
ggcctgcccc 
gagcaggaga 
gaatctggag 
atcctgaaaa 
gggatgtttc 
gaacgttact 
cggagcctca 
gtgctggccc 
tacctgactg 
ttctccaacg 
gctgaggcag 
ggggctgttt 
gcctgcctcc 



ggctatggcc 
tttccgtccc 
ggagctgtca 
aggacagata 
cgtggaattt 
atgtgtcctt 
cacccttgcc 
ccatgctgcc 
agtccgtacg 
tcctggtgaa 
tgctcattaa 
tgcctgccag 
ccgtggccat 
actttatcac 
gcctggaatt 
ccaagttgga 
tttcagtctc 
atatctcagt 
cagccgtgac 
tcattcctgt 
cagctgtatt 
ggctggtgtt 
tggtcttcga 
gctacctcgc 
agaccgcagg 
tctccttcct 
ctccctgtct 



ccagccccga 
ccaccaagtc 
gccagggaaa 
caaagctgtc 
tttcatgacg 
ggtcactgct 
tgagcggaac 
cctgctgtta 
gatcctgggc 
ggtgcagctg 
ttcatttggt 
ctacacggcc 
gatctgcgct 
agcctgtgct 
ctaccgctac 
cctcattagc 
caactctcag 
cctggctttc 
tgttgaggtc 
gtcctgtttc 
catgtggcct 
tgtgccactg 
gcacgatgcc 
cagcctctgc 
agccatcatg 
gttccgggca 
gcctcctgcc 
Page .5 



gatgaggagg 
cggatgctca 
accgagaaca 
tggcttatct 
gccactcagt 
gaactgagca 
tctctcagtg 
ttcacctacc 
agcctggtgg 
gatgctctgc 
gccatcctgc 
cccatcatga 
attgccagtg 
gttatcattt 
taccagcagc 
aaaggagagg 
cccaccaatg 
tctgtctgct 
aagtccagca 
ttgactttca 
gggaaggaca 
ctgctgctgt 
tggttcatct 
atgtgcttcg 
gccttcttcc 
attgtgtgac 
ccttccttct 



gagagaacta 
ctccaaagtc 
ccatcaccat 
tcttcatgct 
atttcacaaa 
aggacgccca 
ccatcttcaa 
tcaactcctt 
ccatcctgct 
ccttctttgt 
agggcagcct 
gtggccaggg 
gctcggaact 
tgaccatcat 
tcaagcttga 
agccaagagc 
aaagccactc 
tcatcttcac 
tcgcaggcag 
atatctttga 
gccgctggct 
gcaacattaa 
tcttcatggc 
ggcccaagaa 
tgtgtctggg 
aaaggatgga 
gccaggggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 



wo 2003/078662 



PCT/US2003/007713 



atcctgagtg 
gggcccggat 
gtttggggct 
tcttggctct 
acgtgtgtct 
gtcctggggt 
tctcccctgc 
agtttttacc 
tttgacagaa 
tt 



gtctggcggt 
ctccaggccc 
cagagtcgag 
gactgatccc 
ctgtgtatgt 
ggctaggagc 
gcctcctcct 
catcatgcac 
accaggtgcc 



39740-000lPCT-t:xt 
tttttcttct aactgacttc tgctttccac 
tggggaggga gcctctggac ggacagtggg 
ggacggggtg tagcctcggc atttgcttga 
tgcttgtgca ggccagtgga ggctcttggg 
gtctgtgtgt ctgcgtccgt gtctgtcaga 
tgggtctgac cgttgtatgg tttgacctga 
ctgtgttttt tccatgtccc cctcccaact 
cctgtacagt tgccacgtta ctgccttttt 
ttcagaggct ctctgattta aataaacctt 



ggcgtgtgct 
gacattgtgg 
gtttctccac 
cttggagaac 
ctgtctgcct 
tatactccat 
ccccatgccc 
taaaaatata 
tcttgttttt 



<210> 309 

<211> 3933 

<212> DNA 

<213> Homo sapiens 



<400> 309 

cacgaggcag 

acctgaggag 

gtctcacgag 

tccgcgtgtg 

agacatcgcg 

ataagttctg 

gaatctgaag 

catcttgata 

ctggatgctg 

ttgaaagcct 

tctgataatg 

tttgatttta 

ggccttgtga 

tgtaccctaa 

cactgcacag 

aagaaaccac 

attgaaattc 

tcttattgtg 

cgctcaattt 

gatatgttta 

ggatatgtct 

cagtgcattg 

tcccttcaac 

cagctattca 

gaacctgatg 

tttggcagca 

gatgtaatgc 

ttacccaccg 

gaagttgcat 

cagattcagg 

cctaatagtc 

ttggaattgg 

caggacacag 

cagttacgtt 

gcaagtcctc 

aatgccacca 

gaagacatta 

agtgccacat 

ggaaaaggag 

gtcgctttga 

ttgcagaatg 

ggaattggaa 

aaacgtgtaa 

ttaataccct 

ccacagctga 

ctgcagggtg 

cattcctttt 

atctaatttt 

ccctttctac 

agacagctca 

taaaaaatgc 



cactctcttc 
aggctcggag 
gggtttcccg 
gagggagcca 
gggaccgatt 
aacgtcgaaa 
ttttttatga 
aggcctctgt 
gtgatttgga 
tggatggttt 
tgaacaaata 
ctcatccatg 
aaaagggtaa 
ctagccgagg 
gccacattca 
ctatgacctg 
ctttagatag 
atgaaagaat 
atgaatatta 
ctaaaggaca 
gggttgaaac 
tatgtgtgaa 
aaacagaatg 
ccaaagttga 
ctttaacttt 
acgacacaga 
tcccctcacc 
ctgaaacgcc 
taaaattaga 
atcagacacc 
ccagtgaata 
tagaaaaact 
atttagactt 
ccttcgatca 
aaagcacagt 
ctaccactgc 
aaatattgat 
catcaccata 
tcatagaaca 
gtcaaagaac 
ctcagagaaa 
cattattaca 
aaggatgcaa 
ctgatttagc 
ccagttatga 
aagaattact 
tttggacact 
agaagcctgg 
ttaatttaca 
ttttctcagt 
acctttttat 



gtcgcttcgg ccagtgtgtc gggctgggcc 
ccgggcccgg accccggcga ttgccgcccg 
cctcgcaccc ccacctctgg acttgccttt 
gcgcttaggc cggagcgagc ctgggggccg 
caccatggag ggcgccggcg gcgcgaacga 
agaaaagtct cgagatgcag ccagatctcg 
gcttgctcat cagttgccac ttccacataa 
gatgaggctt accatcagct atttgcgtgt 
tattgaagat gacatgaaag cacagatgaa 
tgttatggtt ctcacagatg atggtgacat 
catgggatta actcagtttg aactaactgg 
tgaccatgag gaaatgagag aaatgcttac 
agaacaaaac acacagcgaa gcttttttct 
aagaactatg aacataaagt ctgcaacatg 
cgtatatgat accaacagta accaacctca 
cttggtgctg atttgtgaac ccattcctca 
caagactttc ctcagtcgac acagcctgga 
taccgaattg atgggatatg agccagaaga 
tcatgctttg gactctgatc atctgaccaa 
agtcaccaca ggacagtaca ggatgcttgc 
tcaagcaact gtcatatata acaccaagaa 
ttacgttgtg agtggtatta ttcagcacga 
tgtccttaaa ccggttgaat cttcagatat 
atcagaagat acaagtagcc tctttgacaa 
gctggcccca gccgctggag acacaatcat 
aactgatgac cagcaacttg aggaagtacc 
caacgaaaaa ttacagaata taaatttggc 
aaagccactt cgaagtagtg ctgaccctgc 
accaaatcca gagtcactgg aactttcttt 
tagtccttcc gatggaagca ctagacaaag 
ttgtttttat gtggatagtg atatggtcaa 
ttttgctgaa gacacagaag caaagaaccc 
ggagatgtta gctccctata tcccaatgga 
gttgtcacca ttagaaagca gttccgcaag 
tacagtattc cagcagactc aaatacaaga 
caccactgat gaattaaaaa cagtgacaaa 
tgcatctcca tctcctaccc acatacataa 
tagagatact caaagtcgga cagcctcacc 
gacagaaaaa tctcatccaa gaagccctaa 
tacagttcct gaggaagaac taaatccaaa 
gcgaaaaatg gaacatgatg gttcactttt 
gcagccagac gatcatgcag ctactacatc 
atctagtgaa cagaatggaa tggagcaaaa 
atgtagactg ctggggcaat caatggatga 
ttgtgaagtt aatgctccta tacaaggcag 
cagagctttg gatcaagtta actgagcttt 
ggtggctcac tacctaaagc agtctattta 
ctacaatact gcacaaactt ggttagttca 
ttaatgctct tttttagtat gttctttaat 
tttttggtat ttaaaccatt gcattgcagt 
ttatttattt ttggctaggg agtttatccc 
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ctgacaagcc 
cttctctcta 
ccttctcttc 
cccgccgtga 
caagaaaaag 
gcgaagtaaa 
tgtgagttcg 
gaggaaactt 
ttgcttttat 
gatttacatt 
acacagtgtg 
acacagaaat 
cagaatgaag 
gaaggtattg 
gtgtgggtat 
cccatcaaat 
tatgaaattt 
acttttaggc 
aactcatcat 
caaaagaggt 
ttctcaacca 
cttgattttc 
gaaaatgact 
acttaagaag 
atctttagat 
attatataat 
aatgtctcca 
actcaatcaa 
taccatgccc 
ttcacctgag 
tgaattcaag 
attttctact 
tgatgacttc 
ccctgaaagc 
acctactgct 
agaccgtatg 
agaaactact 
aaacagagca 
cgtgttatct 
gatactagct 
tcaagcagta 
actttcttgg 
gacaattatt 
aagtggatta 
cagaaaccta 
ttcttaattt 
tattttctac 
atttttgatc 
gctggatcac 
agcatcattt 
tttttcgaat 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2162 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



wo 2003/078662 



PCT/US2003/007713 



39740-OOOlPCT- txt 
tatttttaag aagatgccaa tataattttt gtaagaaggc agtaaccttt 
ataggcagtt gaaaaatttt tacacctttt ttttcacatt ttacataaat 
tgccagcagt acgtggtagc cacaattgca caatatattt tcttaaaaaa 
tactcatgga atatattctg cgtttataaa actagttttt aagaagaaat 
ctatgaaatt gttaaacctg gaacatgaca ttgttaatca tataataatg 
gctgtatggt ttattattta aatgggtaaa gccatttaca taatatagaa 
atatctagaa ggtatgtggc atttatttgg ataaaattct caattcagag 
atgtttctat agtcactttg ccagctcaaa agaaaacaat accctatgta 
tttatgctaa tattgtgtaa ctgatattaa acctaaatgt tctgcctacc 
aaagatattt tgagcagact gtaaacaaga aaaaaaaaat catgcattct 
gcctagtatg ttaatttgct caaaatacaa tgtttgattt tatgcacttt 
acatcctttt tttcatgtag atttcaataa ttgagtaatt ttagaagcat 
atatatagtt gtcacagtaa atatcttgtt ttttctatgt acattgtaca 
tccttttgct ctttgtggtt ggatctaaca ctaactgtat tgttttgtta 
acatcttctg tggaaaaaaa aaaaaaaaaa aaa 

<210> 310 

<211> 2872 

<212> DNA 

<213> Homo sapiens 



catcatgatc 
aataatgctt 
taccagcagt 
tttttttggc 
attcttaaat 
agatatgcat 
aaatcatctg 
gttgtggaag 
ctgttggtat 
tagcaaaatt 
gtcgctatta 
tattttagga 
aatttttcat 
catcaaataa 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3933 



<400> 310 

tccaggaatc 

ggttgtattg 

accagcgact 

agcaacatga 

accatgacta 

gccttagggg 

gccatgaaca 

ggcatgggcg 

gcggccgcca 

agccgcgcgg 

ccgccctact 

ctcacgctga 

cagcagcgct 

gtggcacgct 

ggcaacatgt 

cagccggggg 

cctgagagcc 

cggggtgtgc 

agcccccaga 

actccagcct 

cccgcctctc 

gacccccact 

cagcagcata 

tctacgttgc 

gagccctcag 

acttcctagc 

aaaaaatcaa 

acaactattt 

tttattattg 

tttttcctgc 

ccccctctct 

aaaacaaaaa 

tacctgaccc 

tggtgaagaa 

ttatatagca 

ttctgataca 

ttaattatgc 

gtaatagata 

gaccccaacc 

cctcaagaac 

acacataaat 

ttacatgttg 

ttcaaaaatc 

agtgttatgc 

ctttgctttc 

gctttaaatg 



gatagtgcat 
ggcagggtgg 
ggaacagcta 
actcaggcct 
cgagcggcaa 
ccggcctgag 
gcatgactgc 
qcatgggtgc 
tgaacccgtg 
gcggcggcgg 
cgtacatctc 
gcgagatcta 
ggcagaactc 
ccccggacaa 
tcgagaacgg 
ccggcggcgg 
gcaaggaccc 
acgggaagac 
ctctggacca 
cctcaactgc 
acccggcaca 
actccttcaa 
agctggactt 
ccgccagcct 
ccctggagcc 
tcccgggact 
cagcaaacaa 
ttatttcatt 
tattcaaaat 
gaagtttaat 
ttcttccctc 
ggaagatggt 
cctacgagtt 
acaagtatac 
agacataaac 
tgcagatagc 
acatgctcag 
ggtgatatac 
cctttgctct 
tcgcttcttt 
tagtttctat 
ggaggaggag 
ctattgatag 
actttccaca 
tcaatgttaa 
atggttaaaa 



tcgtgcgcgc ggccgcccgt cgcttcgcac 
ctccaggatg ttaggaactg tgaagatgga 
ctacgcagac acgcaggagg cctactcctc 
gggctccatg aactccatga acacctacat 
catgaccccg gcgtccttca acatgtccta 
tcccggcgca gtagccggca tgccgggggg 
ggccggcgtg acggccatgg gtacggcgct 
gcagcaggcg gcctccatga tgaatggcct 
catgagcccc atggcgtacg cgccgtccaa 
cgacgccaag acgttcaagc gcagttaccc 
gctcatcacc atggccatcc agcgggcgcc 
ccagtggatc atggacctct tcccctatta 
catccgccac tcgctgtcct tcaatgactg 
gccgggcaag ggctcctact ggacgctgca 
ctgctacttg cgccgccaga agcgcttcaa 
gggcgggagc ggaagcgggg gcagcggcgc 
ctctggcgcc tctaacccca gcgccgactc 
cggccagcta gagggcgcgc cggccccggg 
cagtggggcg acggcgacag ggggcgcctc 
gccccccata agctccgggc ccggggcgct 
cggcttggca ccccacgagt cccagctgca 
ccacccgttG tccatcaaca acctcatgtc 
caaggcatac gaacaggcac tgcaatactc 
gcctctaggc agcgcctcgg tgaccaccag 
ggcgtactac caaggtgtgt attccagacc 
ggggggtttg tctggcatag ccatgctggt 
aaccacacaa accaaaccgt caacagcata 
tttcatgcac aaccttgccc ccagtgcaaa 
tcattgtgta tattactaca aagacggccc 
gatccacaag tgtatatatg aaattctcct 
ttggccctcc agacattcta gtttgtggag 
caagtttgta aaatatttgt ttgtgctttt 
tacaggcttg tggcaatact cttaaccata 
actagaggct cttaaaagta ttgaaaagac 
agattataaa catcagagcc atttgcttct 
agatgtcttt aaatgaaata catgtatatt 
atgtgtagac atcctccgta tatttacata 
gtgatacgtt ctcaagagtt gcttgaccga 
ctacccacag atggccctgg gaacaatcct 
gctttgagag tgccatggtc atgtcattct 
gagtgtatac catttaaaga ttttttcagt 
ataagttata gggagctgga tttcaaacgg 
tggccatttt aatcattgcc atcgtgtgct 
gttggtgtta gtatagccag agggtttcat 
tttattgcat ggtttattct ttttctttac 
ttacaaatta aattgggaat ttttatcaat 
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agggctggat 
agggcatgaa 
ggtcccggtc 
gaccatgaac 
tgccaacccg 
ctcggcgggc 
gagcccgagc 
gggcccctac 
cctgggccgc 
gcacgccaag 
cagcaagatg 
ccggcagaac 
cttcgtcaag 
cccggactcc 
gtgcgagaag 
caagggcggc 
gcccctccat 
cccggccgcc 
ggagttgaag 
ggcctctgtg 
cctgaaaggg 
ctcctcggag 
gccttacggc 
gagccccatc 
cgtcctaaac 
agcaagagag 
ataaaatcca 
agactgttac 
caaaccaatt 
ccttccttgc 
ggttatttaa 
cccccctcct 
agaattgaaa 
aatactgctg 
cagtttacat 
gtgtatggac 
acatatagag 
aagttacaag 
caggaattgc 
gaggtacata 
S-siagggaata 
tggtccaaga 
tgtttcatcc 
tattatttct 
agctgaaatt 
gtgattgtaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920. 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 



wo 2003/078662 



PCT/US2003/007713 



39740-000lPCT.txt 

ttaaaaatat tttgatttaa ataacaaaaa taataccaga ttttaagccg cggaaaatgt 2820 
tcttgatcat ttgcagttaa ggactttaaa taaatcaaat gttaacaaaa aa 2872 

<210> 311 
<211> 926 
<212> DNA 
<213> Homo sapiens 



<400> 311 

ggggcccatt 

ccaccgccgc 

gcgaggtggt 

gggcgcgcct 

acggctgtta 

gcaaggtgga 

actcggaatc 

gcaccctcaa 

atcgcatctt 

gggcaagagg 

agaacaagac 

gaggcactgg 

gggggtggga 

tcacgtttgg 
caccgactga 
aataatttag 



ctgtttcagc 
cgcgggcccc 
gcgctgtctg 
gcctgccctg 
ctcacgcctc 
gattctccag 
cgaagttggg 
cggcgagatc 
gtgtcgctga 
aattacgtgc 
cgatcggcgg 
cgaggagagg 
ttccactcgt 
tgcttctcag 
gaatattgtt 
atggtgaaaa 



cagtcgccaa 
agctgcgcgc 
tctgagcaga 
ctggacgagc 
aaggagctgg 
cacgtcatcg 
acccccgggg 
agcgccctga 
agcgcctccc 
tctgtgggtc 
ccactgcgcc 
gcgctcctct 
gtgtttctat 
atttctgagg 
ttacaatagt 
aaaaaa 



gaatcatgaa 
tgaaggccgg 
gcgtggccat 
agcaggtaaa 
tgcccaccct 
actacatcag 
gccgagggct 
cggccgaggc 
ccagggaccg 
tcccccaacg 
cttaactgca 
ctgcacacct 
tttttgaaaa 
aaattgcttt 
tctgtggggc 



agtcgccagt 
caagacagcg 
ctcgcgctgc 
cgtgctgctc 
gccccagaac 
ggaccttcag 
gccggtccgg 
ggcatgcgtt 
gcggacccca 
cgcctcgccg 
tccagcctgg 
actagtcacc 
gcagacattt 
gtattgtata 
tgtttttttg 



ggcagcaccg 60 
agcggtgcgg 120 
cggggcgccg 180 
tacgacatga 240 
cgcaaggtga 300 
ttggagctga 360 
gctccgctca 420 
cctgcggacg 480 
gccatccagg 540 
gatctgaggg 600 
ggctgaggct 660 
agagacttta 720 
taaaaaatgg 780 
ttacaatgat 840 
ttattaaaca 900 
926 



<210> 312 

<211> 4989 

<212> DNA 

<213> Homo sapiens 



<400> 312 

tttttttttt 

ggaggagggt 

tggccgacga 

ctgaagcgcc 

aaggccgagg 

ctgctgttcc 

atccgcggct 

aaggatattg 

aatgctgacc 

aataactaca 

atggaggaga 

tggaccacaa 

gagaacaatg 

acggcctgtg 

cccaacacct 

ctcagcgccg 

gagtgcccct 

ggtccttgcc 

tctgctcaga 

cgggggaata 

ggctacgtga 

cgcctcatcc 

cagaacttgc 

atgtactttg 

acggggacta 

gcctcctgtg 

atcataacct 

tactacaagg 

tccaacagct 

atcttactac 

ctcaccatgg 

accaatgctt 

cagttaatcg 

gtgcgctggc 

gacaaaatcc 

aaccccaaga 



ttttgagaaa 
ccccgacctc 
gtggagaaat 
tggagaactg 
actaccgcag 
gagtggctgg 
ggaaactctt 
ggctttacaa 
tctgttacct 
ttgtggggaa 
agccgatgtg 
accgctgcca 
agtgctgcca 
tagcttgccg 
acaggtttga 
agagcagcga 
cgggcttcat 
cgaaggtctg 
tgctccaagg 
acattgcttc 
agatccgcca 
taggagagga 
agcaactgtg 
ctttcaatcc 
aagggcgcca 
aaagtgacgt 
ggcaccggta 
aagcaccctt 
ggaacatggt 
atgggctgaa 
tggagaacga 
cagttccttc 
tgaagtggaa 
agcggcagcc 
ccatcaggaa 
ctgaggtgtg 



gggaatttca 
gctgtggggg 
ctgcgggcca 
cacggtgatc 
ctaccgcttc 
cctcgagagc 
ctacaactac 
cctgaggaac 
ctccactgtg 
taagccccca 
tgagaagacc 
gaaaatgtgc 
ccccgagtgc 
ccactactac 
gggctggcgc 
ctccgagggg 
ccgcaacggc 
tgaggaagaa 
atgcaccatc 
agagctggag 
ttctcatgcc 
gcagctagaa 
ggactgggac 
caaattatgt 
aagcaaaggg 
cctgcatttc 
ccggccccct 
taagaatgtc 
ggacgtggac 
gccctggact 
ccatatccgt 
cattcccttg 
ccctccctct 
tcaggacggc 
gtatgccgac 
tggtggggag 



tcccaaataa aaggaatgaa 
ctcctgtttc tctccgccgc 
ggcatcgaca tccgcaacga 
gagggctacc tccacatcct 
cccaagctca cggtcattac 
ctcggagacc tcttccccaa 
gccctggtca tcttcgagat 
attactcggg gggccatcag 
gactggtccc tgatcctgga 
aaggaatgtg gggacctgtg 
accatcaaca atgagtacaa 
ccaagcacgt gtgggaagcg 
ctgggcagct gcagcgcgcc 
tatgccggtg tctgtgtgcc 
tgtgtggacc gtgacttctg 
tttgtgatcc acgacggcga 
agccagagca tgtactgcat 
aagaaaacaa agaccattga 
ttcaagggca atttgctcat 
aacttcatgg ggctcatcga 
ttggtctcct tgtccttcct 
gggaattact ccttctacgt 
caccgcaacc tgaccatcaa 
gtttccgaaa tttaccgcat 
gacataaaca ccaggaacaa 
acctccacca ccacgtcgaa 
gactacaggg atctcatcag 
acagagtatg atgggcagga 
ctcccgccca acaaggacgt 
cagtacgccg tttacgtcaa 
ggggccaaga gtgagatctt 
gacgttcttt cagcatcgaa 
ctgcccaacg gcaacctgag 
tacctttacc ggcacaatta 
ggcaccatcg acattgagga 
aaagggcctt gctgcgcctg 
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gtctggctcc 
gctctcgctc 
ctatcagcag 
gctcatctcc 
cgagtacttg 
cctcacggtc 
gaccaatctc 
gattgagaaa 
tgcggtgtcc 
tccagggacc 
ctaccgctgc 
ggcgtgcacc 
tgacaacgac 
tgcctgcccg 
cgccaacatc 
gtgcatgcag 
cccttgtgaa 
ttctgttact 
taacatccga 
ggtggtgacg 
aaaaaacctt 
cctcgacaac 
stgcagggaaa 
ggaggaagtg 
cggggagaga 
gaatcgcatc 
cttcaccgtt 
tgcctgcggc 
ggagcccggc 
ggctgtgacc 
gtacattcgc 
ctcctcttct 
ttactacatt 
ctgctccaaa 
ggtcacagag 
ccccaaaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 



wo 2003/078662 
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gaagccgaga 
ctgcacaact 
gccaacacca 
accgacccgg 
gagagaactg 
tgcaaccacg 
atgcccgcag 
aactccatct 
gaaataaaat 
aggaagtatg 
caggccacat 
gccaaaacag 
ttgatcgtgg 
aggctgggga 
gtgtacgttc 
gggcaggggt 
cctgaaacca 
gagtttctca 
ctgggtgtgg 
gatctcaaaa 
cctccaagcc 
ctcaacgcca 
gatttcacag 
taccggaaag 
gatggagtct 
gccacactgg 
atggagggcg 
cgcatgtgct 
atcaaagagg 
aagctgcccg 
gacccctcgg 
gagaacggcc 
tacgcccaca 
acctgctgat 
ggtggggggg 
cttcagttct 
atgttccttt 
tgggccttta 
tcctcactct 
ggaaaaataa 
tgtccctgtg 
aaaaacacgt 
ttacaaaggg 
aaatcctgaa 
tgagcatggc 
gactgcccct 
attattattt 
tgaaccggc 



agcaggccga 
ccatcttcgt 
ccatgtccag 
aagagctgga 
tcatttctaa 
aggctgagaa 
aaggagcaga 
ttttaaagtg 
acggatcaca 
gaggggccaa 
ct'Ctctctgg 
gatatgaaaa 
gagggttggt 
atggagtgct 
ctgatgagtg 
cgtttgggat 
gagtggccat 
acgaagcttc 
tgtcccaagg 
gttatctccg 
tgagcaagat 
ataagttcgt 
tcaaaatcgg 
gaggcaaagg 
tcaccactta 
ccgagcagcc 
gccttctgga 
ggcagtataa 
agatggagcc 
agccggagga 
cctcctcgtc 
ccggccctgg 
tgaacggggg 
ccttggatcc 
gagagagagt 
gcccttgctg 
tttcaatatg 
agaaccttaa 
gtccctgtcc 
ttgccacaag 
gccccatcca 
ggagatggaa 
ccatcgttca 
ctttctccct 
agctggttgc 
gctgtgctgc 
gggggaactg 



gaaggaggag 
gcccagacct 
ccgaagcagg 
gacagagtac 
ccttcggcct 
gctgggctgc 
tgacattcct 
gccggaacct 
agttgaggat 
gctaaaccgg 
gaatgggtcg 
cttcatccat 
gattatgctg 
gtatgcctct 
ggaggtggct 
ggtctatgaa 
taaaacagtg 
tgtgatgaag 
ccagccaaca 
gtctctgagg 
gattcagatg 
ccacagagac 
agattttggt 
gctgctgccc 
ctcggacgtc 
ctaccagggc 
caagccagac 
ccccaagatg 
tggcttccgg 
gctggacctg 
ctccctgcca 
ggtgctggtc 
ccgcaagaac 
tgaatctgtg 
tttaacaatc 
cccgcgggag 
caagcagctt 
tgacaacact 
ttccctgttc 
tccagctggg 
accactgtac 
atttttacct 
tccaaggctg 
catcggcccg 
tccatttgag 
tcaaggccac 
gacacaatag 



39740-OOOlPCT . txt 
gctgaatacc gcaaagtctt tgagaatttc 2220 
gaaaggaagc ggagagatgt catgcaagtg 2280 
aacaccacgg ccgcagacac ctacaacatc 2340 
cctttctttg agagcagagt ggataacaag 2400 
ttcacattgt accgcatcga tatccacagc 2460 
agcgcctcca acttcgtctt tgcaaggact .2520 
gggccagtga cctgggagcc aaggcctgaa 2580 
gagaatccca atggattgat tctaatgtat 2640 
cagcgagaat gtgtgtccag acaggaatac 2700 
ctaaacccgg ggaactacac agcccggatt 2760 
tggacagatc ctgtgttctt ctatgtccag 2820 
ctgatcatcg ctctgcccgt cgctgtcctg 2880 
tacgtcttcc atagaaagag aaataacagc 2940 
gtgaacccgg agtacttcag cgctgctgat 3000 
cgggagaaga tcaccatgag ccgggaactt 3060 
ggagttgcca agggtgtggt gaaagatgaa 3120 
aacgaggccg caagcatgcg tgagaggatt 3180 
gagttcaatt gtcaccatgt ggtgcgattg 3240 
ctggtcatca tggaactgat gacacggggc 3300 
ccagaaatgg agaataatcc agtcctagca 3360 
gccggagaga ttgcagacgg catggcatac 3420 
cttgctgccc ggaattgcat ggtagccgaa 3480 
atgacgcgag atatctatga gacagactat 3540 
gtgcgctgga tgtctcctga gtccctcaag 3600 
tggtccttcg gggtcgtcct ctgggagatc 3660 
ttgtccaacg agcaagtcct tcgcttcgtc 3720 
aactgtcctg acatgctgtt tgaactgatg 3780 
aggccttcct tcctggagat catcagcagc 3840 
gaggtctcct tctactacag cgaggagaac 3900 
gagccagaga acatggagag cgtccccctg 3960 
ctgcccgaca gacactcagg acacaaggcc 4020 
ctccgcgcca gcttcgacga gagacagcct 4080 
gagcgggcct tgccgctgcc ccagtcttcg 4140 
caaacagtaa cgtgtgcgca cgcgcagcgg 4200 
cattcacaag cctcctgtac ctcagtggat 4260 
acagcttctc tgcagtaaaa cacatttggg 4320 
tttattccct gcccaaaccc ttaactgaca 4380 
taatagcaac agagcacttg agaaccagtc 4440 
tccctttctc tctcctctct gcttcataac 4500 
aagccctttt tatcagtttg aggaagtggc 4560 
acacccgcct gacaccgtgg gtcattacaa 4620 
ttatctttca cctttctagg gacatgaaat 4680 
ttaccatttt aacgctgcct aattttgcca 4740 
gcgctgattc ctcgtgtccg gaggcatggg 4800 
agacacgctg gcgacacact ccgtccatcc 4860 
aggcacacag gtctcattgc ttctgactag 4920 
gtctttctct cagtgaaggt ggggagaagc 4980 
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ctaccgggcg 

gacgagcggt 

aatttgcttc. 

acttactgtt 

cgacggtccc 

tgacatccgt 

gcaggaggca 

tattgatgag 

caggtatgaa 

acgtgaacag 

gaaagctcaa 

tcaggtacat 

tgctcaggga 



gaggtgagcg 
ggttcgacaa 
tggccttccc 
tagaaaatgt 
cactttcccc 
atccagcttc 
atattacata 
cctgtacggc 
aatgaaagtc 
gagccagcac 
gattccaagg 
atcaagaatg 
acaactaatg 



cggcgccggc 
gtggccttgc 

ctacggatta 
ggcccacgag 
tgagcctcag 
ctgttgtgtc 
atttcagttc 
taaaacatgg 
ttcagaatgg 
gtcgtgtctc 
cctattcaaa 
tcaaagaaga 
ttcattcctc 



tcctcctgcg gcggactttg 
gggccggatc gtcccagtgg 
tacctggcct tcccctacgg 
acgcctggtt actatcaaaa 
cacctgcttg tttggaaggg 
aaaacaacat tgcaaaattg 
cacaaatcca acacaagtaa 
agatgtaata actattattg 
aaggaagtca actgaatttc 
aagatctagc ttctcttctg 
aatcactgaa ggaaaagttt 
cagtaccgca gatgactcaa 
agaacatgct ggacgtaatg 
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ggtgcgactt 60 
aagagttgta 120 
attatactca 180 
ggagcggggt 240 
gtattgaatg 300 
aaatccatga 360 
atgggtctgt 420 
atcgttcctt 480 
caagaaaaat 540 
accctgatga 600 
caggaaatcc 660 
aagacagtgt 720 
gcagaaatgc 780 
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agctgatccc atttctgggg attttaaaga aatttccagc gttaaattag tgagccgtta 840 
tggagaattg aagtctgttc ccactacaca atgtcttgac aatagcaaaa aaaatgaatc 900 
tcccttttgg aagctttatg agtcagtgaa gaaagagttg gatgtaaaat cacaaaaaga 960 
aaatgtccta cagtattgta gaaaatctgg attacaaact gattacgcaa cagagaaaga 1020 
aagtgctgat ggtttacagg gggagaccca actgttggtc tcgcgtaagt caagaccaaa 1080 
atctggtggg agcggccacg ctgtggcaga gcctgcttca cctgaacaag agcttgacca 1140 
gaacaagggg aagggaagag acgtggagtc tgttcagact cccagcaagg ctgtgggcgc 1200 
cagctttcct ctctatgagc cggctaaaat gaagacccct gtacaatatt cacagcaaca 1260 
aaattctcca caaaaacata agaacaaaga cctgtatact actggtagaa gagaatctgt 1320 
gaatctgggt aaaagtgaag gcttcaaggc tggtgataaa actcttactc ccaggaagct 1380 
ttcaactaga aiatcgaacac cagctaaagt tgaagatgca gctgactctg ccactaagcc 1440 
agaaaatctc tcttccaaaa ccagaggaag tattcctaca gatgtggaag ttctgcctac 1500 
ggaaactgaa attcacaatg agccattttt aactctgtgg ctcactcaag ttgagaggaa 1560 
gatccaaaag gattccctca gcaagcctga gaaattgggc actacagctg gacagatgtg 1620 
ctctgggtta cctggtctta gttcagttga tatcaacaac tttggtgatt ccattaatga 1680 
gagtgaggga atacctttga aaagaaggcg tgtgtccttt ggtgggcacc taagacctga 1740 
actatttgat gaaaacttgc ctcctaatac gcctctcaaa aggggagaag ccccaaccaa 1800 
aagaaagtct ctggtaatgc acactccacc tgtcctgaag aaaatcatca aggaacagcc 1860 
tcaaccatca ggaaaacaag agtcaggttc agaaatccat gtggaagtga aggcacaaag 1920 
cttggttata agccctccag ctcctagtcc taggaaaact ccagttgcca gtgatcaacg 1980 
ccgtaggtcc tgcaaaacag cccctgcttc cagcagcaaa tctcagacag aggttcctaa 2040 
gagaggagga gaaagagtgg caacctgcct tcaaaagaga gtgtctatca gccgaagtca 2100 
acatgatatt ttacagatga tatgttccaa aagaagaagt ggtgcttcgg aagcaaatct 2160 
gattgttgca aaatcatggg cagatgtagt aaaacttggt gcaaaacaaa cacaaactaa 2220 
agtcataaaa catggtcctc aaaggtcaat gaacaaaagg caaagaagac ctgctactcc 2280 
aaagaagcct gtgggcgaag ttcacagtca atttagtaca ggccacgcaa actctccttg 2340 
taccataata atagggaaag ctcatactga aaaagtacat gtgcctgctc gaccctacag 2400 
agtgctcaac aacttcattt ccaaccaaaa aatggacttt aaggaagatc tttcaggaat 2460 
agctgaaatg ttcaagaccc cagtgaagga gcaaccgcag ttgacaagca catgtcacat 2520 
cgctatttca aattcagaga atttgcttgg aaaacagttt caaggaactg attcaggaga 2580 
agaacctctg ctccccacct cagagagttt tggaggaaat gtgttcttca gtgcacagaa 2640 
tgcagcaaaa cagccatctg ataaatgctc tgcaagccct cccttaagac ggcagtgtat 2700 
tagagaaaat ggaaacgtag caaaaacgcc caggaacacc tacaaaatga cttctctgga 2760 
gacaaaaact tcagatactg agacagagcc ttcaaaaaca gtatccactg taaacaggtc 2820 
aggaaggtct acagagttca ggaatataca gaagctacct gtggaaagta agagtgaaga 2880 
aacaaataca gaaattgttg agtgcatcct aaaaagaggt cagaaggcaa cactactaca 2940 
acaaaggaga gaaggagaga tgaaggaaat agaaagacct tttgagacat ataaggaaaa 3000 
tattgaatta aaagaaaacg atgaaaagat gaaagcaatg aagagatcaa gaacttgggg 3060 
gcagaaatgt gcaccaatgt ctgacctgac agacctcaag agcttgcctg atacagaact 3120 
catgaaagac acggcacgtg gccagaatct cctccaaacc caagatcatg ccaaggcacc 3180 
aaagagtgag aaaggcaaaa tcactaaaat gccctgccag tcattacaac cagaaccaat 3240 
aaacacccca acacacacaa aacaacagtt gaaggcatcc ctggggaaag taggtgtgaa 3300 
agaagagctc ctagcagtcg gcaagttcac acggacgtca ggggagacca cgcacacgca 3360 
cagagagcca gcaggagatg gcaagagcat cagaacgttt aaggagtctc caaagcagat 3420 
cctggaccca gcagcccgtg taactggaat gaagaagtgg ccaagaacgc ctaaggaaga 3480 
ggcccagtca ctagaagacc tggctggctt caaagagctc ttccagacac caggtccctc 3540 
tgaggaatca atgactgatg agaaaactac caaaatagcc tgcaaatctc caccaccaga 3600 
atcagtggac actccaacaa gcacaaagca atggcctaag agaagtctca ggaaagcaga 3660 
tgtagaggaa gaattcttag cactcaggaa actaacacca tcagcaggga aagccatgct 3720 
tacgcccaaa ccagcaggag gtgatgagaa agacattaaa gcatttatgg gaactccagt 3780 
gcagaaactg gacctggcag gaactttacc tggcagcaaa agacagctac agactcctaa 3840 
ggaaaaggcc caggctctag aagacctggc tggctttaaa gagctcttcc agactcctgg 3900 
tcacaccgag gaattagtgg ctgctggtaa aaccactaaa ataccctgcg actctccaca 3960 
gtcagaccca gtggacaccc caacaagcac aaagcaacga cccaagagaa gtatcaggaa 4020 
agcagatgta gagggagaac tcttagcgtg caggaatcta atgccatcag caggcaaagc 4080 
catgcacacg cctaaaccat cagtaggtga agagaaagac atcatcatat ttgtgggaac 4140 
tccagtgcag aaactggacc tgacagagaa cttaaccggc agcaagagac ggccacaaac 4200 
tcctaaggaa gaggcccagg ctctggaaga cctgactggc tttaaagagc tcttccagac 4260 
ccctggtcat actgaagaag cagtggctgc tggcaaaact actaaaatgc cctgcgaatc 4320 
ttctccacca gaatcagcag acaccccaac aagcacaaga aggcagccca agacaccttt 4380 
ggagaaaagg gacgtacaga aggagctctc agccctgaag aagctcacac agacatcagg 4440 
ggaaaccaca cacacagata aagtaccagg aggtgaggat aaaagcatca acgcgtttag 4500 
ggaaactgca aaacagaaac tggacccagc agcaagtgta actggtagca agaggcaccc 4560 
aaaaactaag gaaaaggccc aacccctaga agacctggct ggctggaaag agctcttcca 4620 
gacaccagta tgcactgaca agcccacgac tcacgagaaa actaccaaaa tagcctgcag 4680 
atcacaacca gacccagtgg acacaccaac aagctccaag ccacagtcca agagaagtct 4740 
caggaaagtg gacgtagaag aagaattctt cgcactcagg aaacgaacac catcagcagg 4800 
caaagccatg cacacaccca aaccagcagt aagtggtgag aaaaacatct acgcatttat 4860 
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gggaactcca gtgcagaaac tggacctgac 
acaaactcct aaggaaaagg cccaggctct 
ccagacacga ggtcacactg aggaatcaat 
caaatcttca caaccagacc tagacaaaaa 
atccctgggg aaagtgggcg tgaaagaaga 
atcaggagag actacacaca cacacacaga 
atttatggag tctccaaagc agatcttaga 
gcagctgaga actcctaagg gaaagtctga 
gctcttccag acaccaagtc acactaagga 
atcctacaga gcttcacagc cagacctagt 
caagagaagt ctcaggaaag cagacactga 
gccatcagca ggcaaagcca tgcacacacc 
caacacgttt ttgggaactc cagtgcagaa 
caatagacgg ctacaaactc gtaaggaaaa 
cagagagctt ttccagacac catgcactga 
aaaaatactc tgcaaatctc cgcaatcaga 
acggcccaag agaagcctca agaaagcaga 
actaacacca tcagcaggca aagccatgca 
agacatcaac acatttgtgg ggactccagt 
tggcagcaag agacggccac aaactcctaa 
tggcttcaaa gagctcttcc agacaccagg 
aatcacagaa gtatcctgca aatctccaca 
caagcaacga ctcaagatat ccttggggaa 
cggcaagctc acacagacgt cagggaagac 
tggaaagagc atcaaagcgt ttaaggaatc 
tggaactggg atggagaggt ggccaagaac 
cctggccggc ttcaaagagc tcttccagac 
tgacaaaact accaaaatag cctgcaaatc 
aagcacaagg aggcggccca aaacaccttt 
agccctgaag cagctcacac agaccacaca 
aggcatcaac gtgttcaggg aaactgcaaa 
tggtagcaag aggcagccaa gaactcctaa 
tggcttgaaa gagctcttcc agacaccagt 
aactaccaaa atagcctgca gatctccaca 
caagccacag tccaagagaa gtctcaggaa 
caggaaacga acaccatcag tagggaaagc 
tgagaaagac atgaaagcat ttatgggaac 
tttacctggc agcaaaagat ggccacaaac 
cctggctggc ttcaaagagc tcttccagac 
gaaaactacc aaaatagcct gcaaatctcc 
cacaaagcaa cggcccaaga gaaacctcag 
actcaggaaa cgaacaccat cagcaggcaa 
tgatgagaaa aatatcaaca catttgtgga 
aaatttacct ggcagcaaga gacagccaca 
ggacctggtt ggcttcaaag aactcttcca 
tgatgacaaa atcacagaag tatcctgtaa 
aagaagctcc aagcaaaggc tcaagatacc 
cctagcagtc agcaagctca cacggacatc 
aacaggagat agtaagagca tcaaagcgtt 
agcagcaagt gtaactggta gcaggaggca 
tctagaagac ctggttgact tcaaagagct 
aatgactatt gacaaaaaca caaaaattcc 
cactgccacg agcacaaaga gatgccccaa 
gctctcagca gttgagaggc tcacgcaaac 
accagcaagc ggtgatgagg gcatcaaagt 
cccagtagaa gaggaaccca gcaggagaag 
cctggaagac ctggccggct tcacagagct 
actgactgct ggcaaagcca ctaaaatacc 
caccacagca agcacaaaga ggcatctcag 
agagccttca gcagtcaagt tcacacaaac 
accagcaggt gaagataaag gcatcaaagc 
tccagcagca agtgtaactg gcagcaggag 
agccatagaa gacctagctg gcttcaaaga 
gactgatgac aaaaccacta aaataccctg 
aacaagctca aagagacggc ccaggacacg 
gttagcagtt ggcaagctca cacaaacctc 
ggtaggtgag ggcaaaggca cgaaagcatt 
agaagatgta attggcagca ggagacagcc 
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agagaactta actggcagca agagacggct 4920 
agaagacctg gctggcttta aagagctctt 4980 
gactaacgat aaaactgcca aagtagcctg 5040 
cccagcaagc tccaagcgac ggctcaagac 5100 
gctcctagca gttggcaagc tcacacagac 5160 
gccaacagga gatggtaaga gcatgaaagc 5220 
ctcagcagca agtctiaactg gcagcaagag 5280 
agtccctgaa gacctggccg gcttcatcga 5340 
atcaatgact aatgaaaaaa ctaccaaagt 5400 
ggacacccca acaagctcca agccacagcc 5460 
agaagaattt ttagcattta ggaaacaaac 5520 
caaaccagca gtaggtgaag agaaagacat 5580 
actggaccag ccaggaaatt tacctggcag 5640 
ggcccaggct ctagaagaac tgactggctt 5700 
taaccccaca gctgatgaga aaactaccaa 5760 
cccagcggac accccaacaa acacaaagca 5820 
cgtagaggaa gaatttttag cattcaggaa 5880 
cacgcctaaa gcagcagtag gtgaagagaa 5940 
ggagaaactg gacctgctag gaaatttacc 6000 
agaaaaggcc aaggctctag aagatctggc 6060 
tcacactgag gaatcaatga ccgatgacaa 6120 
accagaccca gtcaaaaccc caacaagctc 6180 
agtaggtgtg aaagaagagg tcctaccagt 6240 
cacacagaca cacagagaga cagcaggaga 6300 
tgcaaagcag atgctggacc cagcaaacta 6360 
acctaaggaa gaggcccaat cactagaaga 6420 
accagaccac actgaggaat caacaactga 6480 
tccaccacca gaatcaatgg acactccaac 6540 
ggggaaaagg gatatagtgg aagagctctc 6600 
cacagacaaa gtaccaggag atgaggataa 6660 
acagaaactg gacccagcag caagtgtaac 6720 
gggaaaagcc caacccctag aagacttggc 6780 
atgcactgac aagcccacga ctcacgagaa 6840 
accagaccca gtgggtaccc caacaatctt 6900 
agcagacgta gaggaagaat ccttagcact 6960 
tatggacaca cccaaaccag caggaggtga 7020 
tccagtgcag aaattggacc tgccaggaaa 7080 
tcctaaggaa aaggcccagg ctctagaaga 7140 
accaggcact gacaagccca cgactgatga 7200 
acaaccagac ccagtggaca ccccagcaag 7260 
gaaagcagac gtagaggaag aatttttagc 7320 
agccatggac accccaaaac cagcagtaag 7380 
aactccagtg cagaaactgg acctgctagg 7440 
gactcctaag gaaaaggctg aggctctaga 7500 
gacaccaggt cacactgagg aatcaatgac 7560 
atctccacag ccagagtcat tcaaaacctc 7620 
cctggtgaaa gtggacatga aagaagagcc 7680 
aggggagact acgcaaacac acacagagcc 7740 
taaggagtct ccaaagcaga tcctggaccc 7800 
gctgagaact cgtaaggaaa aggcccgtgc 7860 
cttctcagca ccaggtcaca ctgaagagtc 7920 
ctgcaaatct cccccaccag aactaacaga 7980 
gacacgtccc aggaaagaag taaaagagga 8040 
atcagggcaa agcacacaca cacacaaaga 8100 
attgaagcaa cgtgcaaaga agaaaccaaa 8160 
gccaagagca cctaaggaaa aggcccaacc 8220 
ctctgaaaca tcaggtcaca ctcaggaatc 8280 
ctgcgaatct cccccactag aagtggtaga 8340 
gacacgtgtg cagaaggtac aagtaaaaga 8400 
atcaggggaa accacggatg cagacaaaga 8460 
attgaaggaa tctgcaaaac agacaccggc 8520 
acggccaaga gcacccaggg aaagtgccca 8580 
cccagcagca ggtcacactg aagaatcaat 8640 
caaatcatca ccagaactag aagacaccgc 8700 
tgcccagaaa gtagaagtga aggaggagct 8760 
aggggagacc acgcacaccg acaaagagcc 8820 
taagcaacct gcaaagcgga acgtggacgc 8880 
aagagcacct aaggaaaagg cccaacccct 8940 
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ggaagacctg gccagcttcc aagagctctc tcaaacacca ggccacactg aggaactggc 9000 
aaatggtgct gctgatagct ttacaagcgc tccaaagcaa acacctgaca gtggaaaacc 9060 
tctaaaaata tccagaagag ttcttcgggc ccctaaagta gaacccgtgg gagacgtggt 9120 
aagcaccaga gaccctgtaa aatcacaaag caaaagcaac acttccctgc ccccactgcc 9180 
cttcaagagg ggaggtggca aagatggaag cgtcacggga accaagaggc tgcgctgcat 9240 
gccagcacca gaggaaattg tggaggagct gccagccagc aagaagcaga gggttgctcc 9300 
cagggcaaga ggcaaatcat ccgaacccgt ggtcatcatg aagagaagtt tgaggacttc 9360 
tgcaaaaaga attgaacctg cggaagagct gaacagcaac gacatgaaaa ccaacaaaga 9420 
ggaacacaaa ttacaagact cggtccctga aaataaggga atatccctgc gctccagacg 9480 
ccaagataag actgaggcag aacagcaaat aactgaggtc tttgtattag cagaaagaat 9540 
agaaataaac agaaatgaaa agaagcccat gaagacctcc ccagagatgg acattcagaa 9600 
tccagatgat ggagcccgga aacccatacc tagagacaaa gtcactgaga acaaaaggtg 9660 
cttgaggtct gctagacaga atgagagctc ccagcctaag gtggcagagg agagcggagg 9720 
gcagaagagt gcgaaggttc tcatgcagaa tcagaaaggg aaaggagaag caggaaattc 9780 
agactccatg tgcctgagat caagaaagac aaaaagccag cctgcagcaa gcactttgga 9840 
gagcaaatct gtgcagagag taacgcggag tgtcaagagg tgtgcagaaa atccaaagaa 9900 
ggctgaggac aatgtgtgtg tcaagaaaat aacaaccaga agtcataggg acagtgaaga 9960 
tatttgacag aaaaatcgaa ctgggaaaaa tataataaag ttagttttgt gataagttct 10020 
agtgcagttt ttgtcataaa ttacaagtga attctgtaag taaggctgtc agtetgctta 10080 
agggaagaaa actttggatt tgctgggtct gaatcggctt cataaactcc actgggagca 10140 
ctgctgggct cctggactga gaatagttga acaccggggg ctttgtgaag gagtctgggc 10200 
caaggtttgc cctcagcttt gcagaatgaa gccttgaggt ctgtcaccac ccacagccac 10260 
cctacagcag ccttaactgt gacacttgcc acactgtgtc gtcgtttgtt tgcctatgtt 10320 
ctccagggca cggtggcagg aacaactatc ctcgtctgtc ccaacactga gcaggcactc 10380 
ggtaaacacg aatgaatgga taagcgcacg gatgaatgga gcttacaaga tctgtctttc 10440 
caatggccgg gggcatttgg tccccaaatt aaggctattg gacatctgca caggacagtc 10500 
ctatttttga tgtcctttcc tttctgaaaa taaagttttg tgctttggag aatgactcgt 10560 
gagcacatct ttagggacca agagtgactt tctgtaagga gtgactcgtg gcttgccttg 10620 
gtctcttggg aatacttttc taactagggt tgctctcacc tgagacattc tccacccgcg 10680 
gaatctcagg gtcccaggct gtgggccatc acgacctcaa actggctcct aatctccagc 10740 
tttcctgtca ttgaaagctt cggaagttta ctggctctgc tcccgcctgt tttctttctg 10800 
actctatctg gcagcccgat gccacccagt acaggaagtg acaccagtac tctgtaaagc 10860 
atcatcatcc ttggagagac tgagcactca gcaccttcag ccacgatttc aggatcgctt 10920 
ccttgtgagc cgctgcctcc gaaatctcct ttgaagccca gacatctttc tccagcttca 10980 
gacttgtaga tataactcgt tcatcttcat ttactttcca ctttgccccc tgtcctctct 11040 
gtgttcccca aatcagagaa tagcccgcca tcccccagat cacctgtctg gattcctccc 11100 
cattcaccca ccttgccagg tgcaggtgag gatggtgcac cagacagggt agctgtcccc 11160 
caaaatgtgc cctgtgcggg cagtgccctg tctccacgtt tgtttcccca gtgtctggcg 11220 
ggoagccagg tgacatcata aatacttgct gaatgaatgc agaaatcagc ggtactgact 11280 
tgtactatat tggctgccat gatagggttc tcacagcgtc atccatgatc gtaagggaga 11340 
atgacattct gcttgaggga gggaatagaa aggggcaggg aggggacatc tgagggcttc 11400 
acagggctgc aaagggtaca gggattgcac cagggcagaa caggggaggg tgttcaagga 11460 
agagtggctc ttagcagagg cactttggaa ggtgtgaggc ataaatgctt ccttctacgt 11520 
aggccaacct caaaactttc agtaggaatg ttgctatgat caagttgttc taacacttta 11580 
gacttagtag taattatgaa cctcacatag aaaaatttca tccagccata tgcctgtgga 11640 
gtggaatatt ctgtttagta gaaaaatcct ttagagttca gctctaacca gaaatcttgc 11700 
tgaagtatgt cagcaccttt tctcaccctg gtaagtacag tatttcaaga gcacgctaag 11760 
ggtggttttc attttacagg gctgttgatg atgggttaaa aatgttcatt taagggctac 11820 
ccccgtgttt aatagatgaa caccacttct acacaaccct ccttggtact gggggaggga 11880 
gagatctgac aaatactgcc cattccccta ggctgactgg atttgagaac aaatacccac 11940 
ccatttccac catggtatgg taacttctct gagcttcagt ttccaagtga atttccatgt 12000 
aataggacat tcccattaaa tacaagctgt ttttactttt tcgcctccca gggcctgtgc 12060 
gatctggtcc cccagcctct cttgggcttt cttacactaa ctctgtacct accatctcct 12120 
gcctccctta ggcaggcacc tccaaccacc acacactccc tgctgttttc cctgcctgga 12180 
actttcccac cagccccacc aagatcattt catccagtcc tgagctcagc ttaagggagg 12240 
cttcttgcct gtgggttccc tcacccccat gcctgtcctc caggctgggg caggttctta 12300 
gtttgcctgg aattgttctg tacctctttg tagcacgtag tgttgtgaaa ctaagccact 12360 
aattgagttt ctggctcccc tcctggggtt gtaagttttg ttcattcatg agggccgact 12420. 
gtatttcctg gttactgtat cccagtgacc agccacagga gatgtccaat aaagtatgtg 12480 
atgaaatggt cttaaaaaaa aaaaaaaaaa aaaaa 12515 
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<400> 314 

ggcacgaggc 

tcttcccggc 

gctccgcggg 

tcaagatcaa 

ctcctcaccg 

aggctgagct 

tgaagaagcg 

tccaagttcc 

tccaggtgac 

atcccatctt 

gaaactctgg 

aagaggacat 

aagctgatgt 

gcctgcaggc 

gtgagcccat 

aacgtaaaag 

ccaccgaccc 

tccccaagcc 

atgagtttcc 

cggcccctcc 

cagctctggc 

tggccccacc 

tgcagctgca 

ctgtgttcac 

gcatacctgt 

ctcgcctagt 

ggctccccaa 

tctcagccct 

ctggttgcag 

aactgccccc 

ttgtcagtat 

ctggaaaggg 

tctctgtagg 

agacagaagc 

atcatggatt 

ctggcattgt 

gtctctgccg 

ctctctccag 

tgccttaata 

caggaggcat 

aactaataaa 



39740-OOOlPCT . txt 



ggggccgggt 

agagcagccc 
cagcatccca 
tggctacaca 
gcctcacccc 
ctgcccggac 
ggacctggag 
tatagaagag 
agtgcgggac 
tgacaatcgt 
cagctgcctc 
tgaggtgtat 
gcaccgacaa 
tcctgtgcgt 
ggaattccag 
gacatatgag 
ccggcctcca 
agcaccccag 
caccatggtg 
ccaagtcctg 
ccaggcccca 
tgcccccaag 
gtttgatgat 
agacctggca 
ggccccccac 
gacagcccag 
tggcctcctt 
gctgagtcag 
gggattgaag 
aactttgtgg 
ctgtatctct 
gggagctggg 
gaactgtggg 
aggctggagg 
cattacagct 
ccctgtgcct 
gctctttcct 
gatccagaag 
gtagggtaag 
agtttttagt 
gctgttgcca 



cgcagctggg 
aagcagcggg 
ggcgagagga 
ggaccaggga 
cacgagcttg 
cgctgcatcc 
caggctatca 
cagcgtgggg 
ccatcaggca 
gcccccaaca 
ggtggggatg 

ttcacgggac 
gtggccattg 
gtctccatgc 
tacctgccag 
accttcaaga 
cctcgacgca 
ccctatccct 
tttccttctg 
ccccaggctc 
gcccctgtcc 
cccacccagg 
gaagacctgg 
tccgtcgaca 
acaactgagc 
aggccccccg 
tcaggagatg 
atcagctcct 
ccctccaaaa 
atgtcttcct 
ctctcttttt 
gaaactcaaa 
gtcccccatc 
taaggccttt 
taatcaaaat 
aacaccagcg 
tgctcaacca 
gggtttggtc 
ttgttaagag 
gaacaatcaa 
agctggacgg 



cccgcggcat 
gcatgcgctt 
gcacagatac 
cagtgcgcat 
taggaaagga 
acagtttcca 
gtcagcgcat 
actacgacct 
ggcccctccg 
ctgccgagct 
agatcttcct 
caggctggga 
tgttccggac 
agctgcggcg 
atacagacga 
gcatcatgaa 
ttgctgtgcc 
ttacgtcatc 
ggcagatcag 
cagccccngc 
cagtcctagc 
ctggggaagg 
gggccttgct 
actccgagtt 
ccatgctgat 
acccagctcc 
aagacttctc 
aagggggtga 
gcacttacgg 
tggagggggg 
ggaggtgctt 
cttttcccct 
cccatcctcc 
gagcccacaa 
aacgccccag 
tttgaggggc 
tggctgaagg 
tggacttcct 
tgggggagag 
agcacttgga 
cacgagctcg 



ggacgaactg 
ccgctacaag 
caccaagacc 
ctccctggtc 
ctgccgggat 
gaacctggga 
ccagaccaac 
gaatgctgtg 
cctgccgcct 
caagatctgc 
actgtgtgac 
ggcccgaggc 
ccctccctac 
gccttccgac 
tcgtcaccgg 
gaagagtcct 
ttcccgcagc 
cctgagcacc 
ccaggcctcg 
ccctgctcca 
cccaggccct 
aacgctgtca 
tggcaacagc 
tcagcagctg 
ggagtaccct 
tgctccactg 
ctccattgcg 
cgcctgccct 
attctggtgg 
agccatattt 
aagcagaagc 
gtcctgatgg 
agcttctggt 
agccttatca 
ataccagccc 
tgccttcctg 
aaacagtgca 
tgctctcccc 
caggctggca 
ctcttgctct 
tgcc 



ttccccctca 
tgcgaggggc 
caccccacca 
accaaggacc 
ggcttctatg 
atccagtgtg 
aacaacccct 
cggctctgct 
gtcctttctc 
cgagtgaacc 
aaggtgcaga 
tccttttcgc 
gcagacccca 
cgggagctca 
attgaggaga 
ttcagcggac 
tcagcttctg 
atcaactatg 
gccttggccc 
gccatggtat 
cctcaggctg 
gaggccctgc 
acagacccag 
ctgaaccagg 
gaggctataa 

ggggccccgg 

gacatggact 
ccccagagca 
ggtgtgttcc 
tattctttta 
attaacttct 
tcagctccct 
actctcctag 
agtgtcttcc 
ctgtatggca 
ccctacagag 
acagcactgg 
tcttctcaag 
gctctccagt 
ttctactctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2444 



<210> 315 
<211> 732 
<212> DNA 
<213> Homo sapiens 



<400> 315 

tgctgcgaac 

aaacctaagc 

ccagatgggg 

ttccgccttg 

gttgacctga 

gtagttgcca 

gagaccaacc 

ggcaggaaca 

tggtttcacc 

tgacaggagg 

ggaccaggct 

tcttggagct 

cttcctccca 



cacgtgggtc 
agctggaagg 
tccagcgggg 
ttggtctgaa 
aggaccgtcc 
tggtctggga 
ctgcagactc 
ttatacatgg 
ctgaggaact 
gcagaccaca 
gtaggaaatc 
gtgagttctc 

gc 



ccgggcgcgt 
aaccatggcc 
tcttgtggga 
attcatgcaa 
attctttgcc 
ggggctgaat 
caagcctggg 
cagtgattct 
ggtagattac 
ttgcttttca 
tagttattta 
cctgtacagt 



ttcgggtgct 
aactgtgagc 
gagattatca 
gcttccgaag 
ggcctggtga 
gtggtgaaga 
accatccgtg 
gtggagagtg 
acgagctgtg 
catccatttc 
caggaacttc 
gttaccatcc 



ggcggctgca 
gtaccttcat 
agcgttttga 
atcttctcaa 
aatacatgca 
cgggccgagt 
gagacttctg 
cagagaagga 
ctcagaactg 
ccctccttcc 
atcataattt 
ccgaccatct 



gccggagttc 60 
tgcgatcaaa 120 
gcagaaagga 180 
ggaacactac 240 
ctcagggccg 300 
catgctcggg 360 
catacaagtt 420 
gatcggcttg 480 
gatctatgaa 540 
catgggcaga 600 
ggagggaagc 660 
gattaaaatg 720 
732 



<210> 316 . 

<211> 2422 

<212> DNA 

<213> Homo sapi ens 
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<400> 316 

gtcagcctcc cttccaccgc catattgggc cactaaaaaa agggggctcg tcttttcggg 60 
gtgtttttct ccccctcccc tgtccccgct tgctcacggc tctgcgactc cgacgccggc 120 
aaggtttgga gagcggctgg gttcgcggga cccgcgggct tgcacccgcc cagactcgga 180 
cgggctttgc caccctctcc gcttgcctgg tcccctctcc tctccgccct cccgctcgcc 240 
agtccatttg atcagcggag actcggcggc cgggccgggg cttccccgca gcccctgcgc 300 
gctcctagag ctcgggccgt ggctcgtcgg ggtctgtgtc ttttggctcc gagggcagtc 360 
gctgggcttc cgagaggggt tcgggccgcg taggggcgct ttgttttgtt cggttttgtt 420 
tttttgagag tgcgagagag gcggtcgtgc agacccggga gaaagatgtc aaacgtgcga 480 
gtgtctaacg ggagccctag cctggagcgg atggacgcca ggcaggcgga gcaccccaag 540 
ccctcggcct gcaggaacct cttcggcccg gtggaccacg aagagttaac ccgggacttg 600 
gagaagcact gcagagacat ggaagaggcg agccagcgca agtggaattt cgattttcag 660 
aatcacaaac ccctagaggg caagtacgag tggcaagagg tggagaaggg cagcttgccc 720 
gagttctact acagaccccc gcggcccccc aaaggtgcct gcaaggtgcc ggcgcaggag 780 
agccaggatg tcagcgggag ccgcccggcg gcgcctttaa ttggggctcc ggctaactct 840 
gaggacacgc atttggtgga cccaaagact gatccgtcgg acagccagac ggggttagcg 900 
gagcaatgcg caggaataag gaagcgacct gcaaccgacg attcttctac tcaaaacaaa 960 
agagccaaca gaacagaaga aaatgtttca gacggttccc caaatgccgg ttctgtggag 1020 
cagacgccca agaagcctgg cctcagaaga cgtcaaacgt aaacagctcg aattaagaat 1080 
atgtttcctt gtttatcaga tacatcactg cttgatgaag caaggaagat atacatgaaa 1140 
attttaaaaa tacatatcgc tgacttcatg gaatggacat cctgtataag cactgaaaaa 1200 
caacaacaca ataacactaa aattttaggc actcttaaat gatctgcctc taaaagcgtt 1260 
ggatgtagca ttatgcaatt aggtttttcc ttatttgctt cattgtacta cctgtgtata 1320 
tagtttttac cttttatgta gcacataaac tttggggaag ggagggcagg gtggggctga 1380 
ggaactgacg tggagcgggg tatgaagagc ttgctttgat ttacagcaag tagataaata 1440 
tttgacttgc atgaagagaa gcaattttgg ggaagggttt gaattgtttt ctttaaagat 1500 
gtaatgtccc tttcagagac agctgatact tcatttaaaa aaatcacaaa aatttgaaca 1560 
ctggctaaag ataattgcta tttattttta caagaagttt attctcattt gggagatctg 1620 
gtgatctccc aagctatcta aagtttgtta gatagctgca tgtggctttt ttaaaaaagc 1680 
aacagaaacc tatcctcact gccctcccca gtctctctta aagttggaat ttaccagtta 1740 
attactcagc agaatggtga tcactccagg tagtttgggg caaaaatccg aggtgcttgg 1800 
gagttttgaa tgttaagaat tgaccatctg cttttattaa atttgttgac aaaattttct 1860 
cattttcttt tcacttcggg ctgtgtaaac acagtcaaaa taattctaaa tccctcgata 1920 
tttttaaaga tetgtaagta acttcacatt aaaaaatgaa atatttttta atttaaagct 1980 
tactctgtcc atttatccac aggaaagtgt tatttttaaa ggaaggttca tgtagagaaa 2040 
agcacacttg taggataagt gaaatggata ctacatcttt aaacagtatt tcattgcctg 2100 
tgtatggaaa aaccatttga agtgtacctg tgtacataac tctgtaaaaa cactgaaaaa 2160 
ttatactaac ttatttatgt taaaagattt tttttaatct agacaatata caagccaaag 2220 
tggcatgttt tgtgcatttg taaatgctgt gttgggtaga ataggttttc ccctcttttg 2280 
ttaaataata tggctatgct taaaaggttg catactgagc caagtataat tttttgtaat 2340 
gtgtgaaaaa gatgccaatt attgttacac attaagtaat caataaagaa aacttccata 2400 
gctaaaaaaa aaaaaaaaaa aa 2422 

<210> 317 

<211> 5061 

<212> DNA 

<213> Homo sapiens 

<400> 317 

atggctcaga tatttagcaa cagcggattt aaagaatgtc cattttcaca tccggaacca 60 
acaagagcaa aagatgtgga caaagaagaa gcattacaga tggaagcaga ggctttagca 120 
aaactgcaaa aggatagaca agtgactgac aatcagagag gctttgagtt gtcaagcagc 180 
accagaaaaa aagcacaggt ttataacaag caggattatg atctcatggt gtttcctgaa 240 
tcagattccc aaaaaagagc attagatatt gatgtagaaa agctcaccca agctgaactt 300 
gagaaactat tgctggatga cagtttcgag actaaaaaaa cacctgtatt accagttact 360 
cctattctga gcccttcctt ttcagcacag ctctatttta gacctactat tcagagagga 420 
cagtggccac ctggattacc tgggccttcc acttatgctt taccttctat ttatccttct 480 
acttacagta aacaggctgc attccaaaat ggcttcaatc caagaatgcc cacttttcca 540 
tctacagaac ctatatattt aagtcttccg ggacaatctc catatttctc atatcctttg 600 
acacctgcca caccctttca tccacaagga agcttaccta tctatcgtcc agtagtcagt 660 
actgacatgg caaaactatt tgacaaaata gctagtacat cagaattttt aaaaaatggg 720 
aaagcaagga ctgatttgga gataacagat tcaaaagtca gcaatctaca ggtatctcca 780 
aagtctgagg atatcagtaa atttgactgg ttagacttgg atcctctaag taagcctaag 840 
gtggataatg tggaggtatt agaccatgag gaagagaaaa atgtttcaag tttgctagca 900 
aaggatcctt gggatgctgt tcttcttgaa gagagatcga cagcaaattg tcatcttgaa 960 
agaaaggtga atggaaaatc cctttctgtg gcaactgtta caagaagcca gtctttaaat 1020 
attcgaacaa ctcagcttgc aaaagcccag ggccatatat ctcagaaaga cccaaatggg 1080 
accagtagtt tgccaactgg aagttctctt cttcaagaag ttgaagtaca gaatgaggag 1140 
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atggcagctt tttgtcgatc cantacaaaa ttgaagacca aatttccata taccaatcac 1200 
cgcacaaacc caggctattt gttaagtcca gtcacagcgc aaagaaacat atgcggagaa 1260 
aatgctagtg tgaaggtctc cattgacatt gaaggatttc agctaccagt tacttttacg 1320 
tgtgatgtga gttctactgt agaaatcatt ataatgcaag ccctttgctg ggtacatgat 1380 
gacttgaatc aagtagatgt tggcagctat gttctaaaag tttgtggtca agaggaagtg 1440 
ctgcagaata atcattgcct tggaagtcat gagcatiattc aaaactgtcg aaaatgggac 1500 
acagaaatta gactacaact cttgaccttc agtgcaatgt gtcaaaatct ggcccgaaca 1560 
gcagaagatg atgaaacacc cgtggattta aacaaacacc tgtatcaaat agaaaaacct 1620 
tgcaaagaag ccatgacgag acaccctgtt gaagaactct tagattctta tcacaaccaa 1680 
gtagaactgg ctcttcaaat tgaaaaccaa caccgagcag tagatcaagt aattaaagct 1740 
gtaagaaaaa tctgtagtgc tttagatggt gtcgagactc ttgccattac agaatcagta 1800 
aagaagctaa agagagcagt taatcttcca aggagtaaaa ctgctgatgt gacttctttg 1860 
tttggaggag aagacactag caggagttca actaggggct cacttaatcc tgaaaatcct 1920 
gttcaagtaa gcataaacca attaactgca gcaatttatg atcttctcag actccatgca 1980 
aattctggta ggagtcctac agactgtgcc caaagtagca agagtgtcaa ggaagcatgg 2040 
actacaacag agcagctcca gtttactatt tttgctgctc atggaatttc aagtaattgg 2100 
gtatcaaatt atgaaaaata ctacttgata tgttcactgt ctcacaatgg aaaggatctt 2160 
tttaaaccta ttcaatcaaa gaaggttggc acttacaaga atttcttcta tcttattaaa 2220 
tgggatgaac taatcatttt tcctatccag atatcacaat tgccattaga atcagttctt 2280 
caccttactc tttttggaat tttaaatcag agcagtggaa gttcccctga ttctaataag 2340 
cagagaaagg gaccagaagc tttgggcaaa gtttctttac ctctttgtga ctttagacgg 2400 
tttttaacat gtggaactaa acttctatat ctttggactt catcacatac aaattctgtt 2460 
cctggaacag ttaccaaaaa aggatatgtc atggaaagaa tagtgctaca ggttgatttt 2520 
ccttctcctg catttgatat tatttataca actcctcaag ttgacagaag cattatacag 2580 
caacataact tagaaacact agagaatgat ataaaaggga aacttcttga tattcttcat 2640 
aaagactcat cacttggact ttctaaagaa gataaagctt ttttatggga gaaacgttat 2700 
tattgcttca aacacccaaa ttgtcttcct aaaatattag caagcgcccc aaactggaaa 2760 
tggggtaatc ttgccaaaac ttactcattg cttcaccagt ggcctgcatt gtacccacta 2820 
attgcattgg aacttcttga ttcaaaattt gctgatcagg aagtaagatc cctagctgtg 2880 
acctggattg aggccattag tgatgatgag ctaacagatc ttcttccaca gtttgtacaa 2940 
gctttgaaat atgaaattta cttgaatagt tcattagtgc aattcctttt gtccagggca 3000 
ttgggaaata tccagatagc acacaattta tattggcttc tcaaagatgc cctgcatgat 3060 
gtacagttta gtacccgata cgaacatgtt ttgggtgctc tcctgtcagt aggaggaaaa 3120 
cgacttagag aagaacttct aaaacagacg aaacttgtac agcttttagg aggagtagca 3180 
gaaaaagtaa ggcaggctag tggatcagcc agacaggttg ttctccaaag aagtatggaa 3240 
cgagtacagt ccttttttca gaaaaataaa tgccgtctcc ctctcaagcc aagtctagtg 3300 
gcaaaagaat taaatattaa gtcgtgttcc ttcttcagtt ctaatgctgt ccccctaaaa 3360 
gtcacaatgg tgaatgctga ccctctggga gaagaaatta atgtcatgtt taaggttggt 3420 
gaagatcttc ggcaagatat gttagcttta cagatgataa agattatgga taagatctgg 3480 
cttaaagaag gactagatct gaggatggta attttcaaat gtctctcaac tggcagagat 3540 
cgaggcatgg tggagctggt tcctgcttcc gataccctca ggaaaatcca agtggaatat 3600 
ggtgtgacag gatcctttaa agataaacca cttgcagagt ggctaaggaa atacaatccc 3660 
tctgaagaag aatatgaaaa ggcttcagag aactttatct attcctgtgc tggatgctgt 3720 
gtagccacct atgttttagg catctgtgat cgacacaatg acaatataat gcttcgaagc 3780 
acgggacaca tgtttcacat tgactttgga aagtttttgg gacatgcaca gatgtttggc 3840 
agcttcaaaa gggatcgggc tccttttgtg ctgacctctg atatggcata tgtcattaat 3900 
gggggtgaaa agcccaccat tcgttttcag ttgtttgtgg acctctgctg tcaggcctac 3960 
aacttgataa gaaagcagac aaaccttttt cttaacctcc tttcactgat gattccttca 4020 
gggttaccag aacttacaag tattcaagat ttgaaatacg ttagagatgc acttcaaccc 4080 
caaactacag acgcagaagc tacaattttc tttactaggc ttattgaatc aagtttggga 4140 
agcattgcca caaagtttaa cttcttcatt cacaaccttg ctcagcttcg tttttctggt 4200 
cttccttcta atgatgagcc catcctttca ttttcaccta aaacatactc ctttagacaa 4260 
gatggtcgaa tcaaggaagt ctctgttttt acatatcata agaaatacaa cccagataaa 4320 
cattatattt atgtagtccg aattttgtgg gaaggacaga ttgaaccatc atttgtcttc 4380 
cgaacatttg tcgaatttca ggaacttcac aataagctca gtattatttt tccactttgg 4440 
aagttaccag gctttcctaa taggatggtt ctaggaagaa cacacataaa agatgtagca 4500 
gccaaaagga aaattgagtt aaacagttac ttacagagtt tgatgaatgc ttcaacggat 4560 
gtagcagagt gtgatcttgt ttgtactttc ttccaccctt tacttcgtga tgagaaagct 4620 
gaagggatag ctaggtctgc agatgcaggt tccttcagtc ctactccagg ccaaatagga 4680 
ggagctgtga aattatccat ctcttaccga aatggtactc ttttcatcat ggtgatgcat 4740 
atcaaagatc ttgttactga agatggagct gacccaaatc catatgtcaa aacataccta 4800 
cttccagata accacaaaac atccaaacgt aaaaccaaaa tttcacgaaa aacgaggaat 4860 
ccgacattca atgaaatgct tgtatacagt ggatatagca aagaaaccct aagacagcga 4920 
gaacttcaac taagtgtact cagtgcagaa tctctgcggg agaatttttt cttgggtgga 4980 
gtaaccctgc ctttgaaaga tttcaacttg agcaaagaga cggttaaatg gtatcagctg 5040 
actgcggcaa catacttgta a . 5061 



Page 65 



wo 2003/078662 



PCT/US2003/007713 



39740-000lPCT.txt 

<210> 318 

<211> 3014 

<212> DNA 

<213> Homo sapiens 

<400> 318 

ctgaccagcg ccgccctccc ccgcccccga cccaggaggt ggagatccct ccggtccagc 60 
cacattcaac acccactttc tcctccctct gcccctatat tcccgaaacc ccctcctcct 120 
tcccttttcc ctcctccctg gagacggggg aggagaaaag gggagtccag tcgtcatgac 180 
tgagctgaag gcaaagggtc cccgggctcc ccacgtggcg ggcggcccgc cctcccccga 240 
99"tcggatcc ccactgctgt gtcgcccagc cgcaggtccg ttcccgggga gccagacctc 300 
ggacaccttg cctgaagttt cggccatacc tatctccctg gacgggctac tcttccctcg 360 
gccctgccag ggacaggacc cctccgacga aaagacgcag gaccagcagt cgctgtcgga 420 
cgtggagggc gcatattcca gagctgaagc tacaaggggt gctggaggca gcagttctag 480 
tcccccagaa aaggacagcg gactgctgga cagtgtcttg gacactctgt tggcgccctc 540 
aggtcccggg cagagccaac ccagccctcc cgcctgcgag gtcaccagct cttggtgcct 600 
gtttggcccc gaacttcccg aagatccacc ggctgccccc gccacccagc gggtgttgtc 660 
cccgctcatg agccggtccg ggtgcaaggt tggagacagc tccgggacgg cagctgccca 720 
taaagtgctg ccccggggcc tgtcaccagc ccggcagctg ctgctcccgg cctctgagag 780 
ccctcactgg tccggggccc cagtgaagcc gtctccgcag gccgctgcgg tggaggttga 840 
ggaggaggat ggctctgagt ccgaggagtc tgcgggtccg cttctgaagg gcaaacctcg 900 
ggctctgggt ggcgcggcgg ctggaggagg agccgcggct gtcccgccgg gggcggcagc 960 
sggaggcgtc gccctggtcc ccaaggaaga ttcccgcttc tcagcgccca gggtcgccct 1020 
ggtggagcag gacgcgccga tggcgcccgg gcgctccccg ctggccacca cggtgatgga 1080 
tttcatccac gtgcctatcc tgcctctcaa tcacgcctta ttggcagccc gcactcggca 1140 
gctgctggaa gacgaaagtt acgacggcgg ggccggggct gccagcgcct ttgccccgcc 1200 
gcggagttca ccctgtgcct cgtccacccc ggtcgctgta ggcgacttcc ccgactgcgc 1260 
gtacccgccc gacgccgagc ccaaggacga cgcgtaccct ctctatagcg acttccagcc 1320 
gcccgctcta aagataaagg aggaggagga aggcgcggag gcctccgcgc gctccccgcg 1380 
ttcctacctt gtggccggtg ccaaccccgc agccttcccg gatttcccgt tggggccacc 1440 
gcccccgctg ccgccgcgag cgaccccatc cagacccggg gaagcggcgg tgacggccgc 1500 
acccgccagt gcctcagtct cgtctgcgtc ctcctcgggg tcgaccctgg agtgcatcct 1560 
gtacaaagcg gagggcgcgc cgccccagca gggcccgttc gcgccgccgc cctgcaaggc 1620 
gccgggcgcg agcggctgcc tgctcccgcg ggacggcctg ccctccacct ccgcctctgc 1680 
cgccgccgcc ggggcggccc ccgcgctcta ccctgcactc ggcctcaacg ggctcccgca 1740 
gctcggctac caggccgccg tgctcaagga gggcctgccg caggtctacc cgccctatct 1800 
caactacctg aggccggatt cagaagccag ccagagccca caatacagct tcgagtcatt 1860 
acctcagaag atttgtttaa tctgtgggga tgaagcatca ggctgtcatt atggtgtcct 1920 
tacctgtggg agctgtaagg tcttctttaa gagggcaatg gaagggcagc acaactactt 1980 
atgtgctgga agaaatgact gcatcgttga taaaatccgc agaaaaaact gcccagcatg 2040 
tcgccttaga aagtgctgtc aggctggcat ggtccttgga ggtcgaaaat ttaaaaagtt 2100 
caataaagtc agagttgtga gagcactgga tgctgttgct ctcccacagc cagtgggcgt 2160 
tccaaatgaa agccaagccc taagccagag attcactttt tcaccaggtc aagacataca 2220 
gttgattcca ccactgatca acctgttaat gagcattgaa ccagatgtga tctatgcagg 2280 
acatgacaac acaaaacctg acacctccag ttctttgctg acaagtctta atcaactagg 2340 
cgagaggcaa cttctttcag tagtcaagtg gtctaaatca ttgccaggtt ttcgaaactt 2400 
acatattgat gaccagataa ctctcattca gtattcttgg atgagcttaa tggtgtttgg 2460 
tctaggatgg agatcctaca aacacgtcag tgggcagatg ctgtattttg cacctgatct 2520 
aatactaaat gaacagcgga tgaaagaatc atcattctat tcattatgcc ttaccatgtg 2580 
gcagatccca caggagtttg tcaagcttca agttagccaa gaagagttcc tctgtatgaa 2640 
agtattgtta cttcttaata caattccttt ggaagggcta cgaagtcaaa cccagtttga 2700 
ggagatgagg tcaagctaca ttagagagct catcaaggca attggtttga ggcaaaaagg 2760 
agttgtgtcg agctcacagc gtttctatca acttacaaaa cttcttgata acttgcatga 2820 
tcttgtcaaa caacttcatc tgtactgctt gaatacattt atccagtccc gggcactgag 2880 
tgttgaattt ccagaaatga tgtctgaagt tattgctgca caattaccca agatattggc 2940 
agggatggtg aaaccccttc tctttcataa aaagtgaatg tcatcttttt cttttaaaga 3000 
attaaatttt gtgg 3014 

<210> 319 
<211> 2148 
<212> DNA 

<213> Homo sapiens 
<400> 319 

gcttcagggt acagctcccc cgcagccaga agccgggcct gcagcgcctc agcaccgctc 60 
cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 120 
actctctgag gaaaaaccat tttgattatt actctcagac gtgcgtggca acaagtgact 180 
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gagacctaga aatccaagcg ttggaggtcc tgaggccagc ctaagtcgct tcaaaatgga 240 
acgaaggcgt ttgtggggtt ccattcagag ccgatacatc agcatgagtg tgtggacaag 300 
cccacggaga cttgtggagc tggcagggca gagcctgctg aaggatgagg ccctggccat 360 
tgccgccctg gagttgctgc ccagggagct cttcccgcca ctcttcatgg cagcctttga 420 
cgggagacac agccagaccc tgaaggcaat ggtgcaggcc tggcccttca cctgcctccc 480 
tctgggagtg ctgatgaagg gacaacatct tcacctggag accttcaaag ctgtgcttga 540 
tggacttgat gtgctccttg cccaggaggt tcgccccagg aggtggaaac ttcaagtgct 600 
ggatttacgg aagaactctc atcaggactt ctggactgta tggtctggaa acagggccag 660 
tctgtactca tttccagagc cagaagcagc tcagcccatg acaaagaagc gaaaagtaga 720 
tggtttgagc acagaggcag agcagccctt cattccagta gaggtgctcg tagacctgtt 780 
cctcaaggaa ggtgcctgtg atgaattgtt .ctcctacctc attgagaaag tgaagcgaaa 840 
gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatt tttgcaatgc ccatgcagga 900 
tatcaagatg atcctgaaaa tggtgcagct ggactctatt gaagatttgg aagtgacttg 960 
tacctggaag ctacccacct tggcgaaatt ttctccttac ctgggccaga tgattaatct 1020 
gcgtagactc ctcctctccc acatccatgc atcttcctac atttccccgg agaaggaaga 1080 
gcagtatatc gcccagttca cctctcagtt cctcagtctg cagtgcctgc aggctctcta 1140 
tgtggactct ttatttttcc ttagaggccg cctggatcag ttgctcaggc acgtgatgaa 1200 
ccccttggaa accctctcaa taactaactg ccggctttcg gaaggggatg tgatgcatct 1260 
gtcccagagt cccagcgtea gtcagctaag tgtcctgagt ctaagtgggg tcatgctgac 1320 
cgatgtaagt cccgagcccc tccaagctct gctggagaga gcctctgcca ccctccagga 1380 
cctggtcttt gatgagtgtg ggatcacgga tgatcagctc cttgccctcc tgccttccct 1440 
gagccactgc tcccagctta caaccttaag cttctacggg aattccatct ccatatctgc 1500 
cttgcagagt ctcctgcagc acctcatcgg gctgagcaat ctgacccacg tgctgtatcc 1560 
tgtccccctg gagagttatg aggacatcca tggtaccctc cacctggaga ggcttgccta 1620 
tctgcatgcc aggctcaggg agttgctgtg tgagttgggg cggcccagca tggtctggct 1680 
tagtgccaac ccctgtcctc actgtgggga cagaaccttc tatgacccgg agcccatcct 1740 
gtgcccctgt ttcatgccta actagctggg tgcacatatc aaatgcttca ttctgcatac 1800 
ttggacacta aagccaggat gtgcatgcat cttgaagcaa caaagcagcc acagtttcag 1860 
acaaatgttc agtgtgagtg aggaaaacat gttcagtgag gaaaaaacat tcagacaaat 1920 
gttcagtgag gaaaaaaagg ggaagttggg gataggcaga tgttgacttg aggagttaat 1980 
gtgatctttg gggagataca tcttatagag ttagaaatag aatctgaatt tctaaaggga 2040 
gattctggct tgggaagtac atgtaggagt taatccctgt gtagactgtt gtaaagaaac 2100 
tgttgaaaat aaagagaagc aatgtgaagc aaaaaaaaaa aaaaaaaa 2148 

<210> 320 
<211> 540 
<212> DNA 
<213> Homo sapiens 

<400> 320 

atccctgact cggggtcgcc tttggagcag agaggaggca atggccacca tggagaacaa 60 
ggtgatctgc gccctggtcc tggtgtccat gctggccctc ggcaccctgg ccgaggccca 120 
gacagagacg tgtacagtgg ccccccgtga aagacagaat tgtggttttc ctggtgtcac 180 
gccctcccag tgtgcaaata agggctgctg tttcgacgac accgttcgtg gggtcccctg 240 
gtgcttctat cctaatacca tcgacgtccc tccagaagag gagtgtgaat tttagacact 300 
tctgcaggga tctgcctgca tcctgacggg gtgccgtccc cagcacggtg attagtccca 360 
gagctcggct gccacctcca ccggacacct cagacacgct tctgcagctg tgcctcggct 420 
cacaacacag attgactgct ctgactttga ctactcaaaa ttggcctaaa aattaaaaga 480 
gatcgatatt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 540 



<210> 321 

<211> 2346 . 
<212> DNA 

<213> Homo sapiens < 
<400> 321 

gcacgaggct gcggcgggtc cgggcccatg aggcgacgaa ggaggcggga cggcttttac 60 
ccagccccgg acttccgaga cagggaagct gaggacatgg caggagtgtt tgacatagac 120 
ctggaccagc cagaggacgc gggctctgag gatgagctgg aggagggggg tcagttaaat 180 
gaaagcatgg accatggggg agttggacca tatgaacttg gcatggaaca ttgtgagaaa 240 
tttgaaatct cagaaactag tgtgaacaga gggccagaaa aaatcagacc agaatgtttt 300 
gagctacttc gggtacttgg taaagggggc tatggaaagg tttttcaagt acgaaaagta 360 
acaggagcaa atactgggaa aatatttgcc atgaaggtgc ttaaaaaggc aatgatagta 420 
agaaatgcta aagatacagc tcatacaaaa gcagaacgga atattctgga ggaagtaaag 480 
catcccttca tcgtggattt aatttatgcc tttcagactg gtggaaaact ctacctcatc 540 
cttgagtatc tcagtggagg agaactattt atgcagttag aaagagaggg aatatttatg 600 
gaagacactg cctgctttta cttggcagaa atctccatgg ctttggggca tttacatcaa 660 
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gtgaaactaa 
acattttgtg 
cgtgctgtgg 
ccattcactg 
ttgcctccct 
gctgcttctc 
tttagacaca 
ctgttgcaat 
gtcgacagcc 
acatatgtgg 
atccgatcac 
tctcctgggg 
gtggaatacc 
gcatcggcac 
cccatgatct 
aatgaattta 
agactcaaaa 
aaaaataaac 
tagtattgct 
cttatcaagg 
tctgaatcac 
ttgcctataa 
aggcaaaatg 
acctgaatct 
tatgaaaaac 
caacttctgt 
ctgtggctcg 
gagcct 



tctacagaga 
cagactttgg 
gaacaataga 
attggtggag 
gggagaatag 
acctcacaca 
gtctgggagc 
ttaactggga 
ctgaagagga 
cagatgactc 
ctccatctgt 
ctcgaagatt 
atttctgggg 
caatggaaac 
cacttccaat 
ccaaacggcc 
aggcaaaaag 
tgacagtttc 
gtggatttta 
gaactcttag 
attttcatgt 
tgtgagtctg 
tacttgcaac 
ggaaggcaaa 
tttttttata 
atcccaaact 
ttcttctctt 
tttgagggat 



cctgaagccg 
actatgcaaa 
atacatggcc 
tttgggagca 
aaagaaaaca 
agaagccaga 
tggtcctggg 
agaacttctg 
tgtaagtcag 
aactctcagt 
acttgaaagt 
tattggcagc 
aagaggtgct 
aagtggcata 
acgacagccg 
agagcacctg 
gtggagaggg 
agagagtcaa 
aaaaatcaat 
gcacatcaat 
tgatgactcg 
atgtgtgaag 
taaggacaaa 
gacaaaagaa 
taaatatata 
ttaaaatgcg 
ggtgaaataa 
tggggtggac 



39740-000lPCT.txt 
gagaatatca tgcttaatca 
gaatctattc atgatggaac 
cctgaaatct tgatgagaag 
ttaatgtatg acatgctgac 
attgacaaaa tcctcaaatg 
gatctgctta aaaagctgct 
gacgctggag aagttcaagc 
gctcgaaagg tggagccccc 
tttgattcca agtttacacg 
gaaagtgcca atcaggtctt 
gtgaaagaaa agttttcctt 
ccacgaacac ctgtcagccc 
tcggccagca cagcaaatcc 
gagcagatgg atgtgacaat 
aactctgggc catacaaaaa 
cgtatgaatc tatgacagag 
agatgtgtga gcatcctgca 
tgtcattaca tagaacactt 
caatggtgca aaaaaaaact 
taattgattc ctcgcgacat 
aaactgacag tattaagggt 
aagggtatcc tttcattagg 
ttagcatgca agcttggtca 
acttaccaat tgatgtttta 
tttttcaaat agatttttga 
aaattattgg ttggtgtgaa 
taaaatgcaa atgaatcatt 
ctggggttta ttttcagtaa 



ccaaggtcat 
agtcacacac 
tggccacaat 
tggagcaccc 
taaactcaat 
gaaaagaaat 
tcatccattc 
ctttaaacct 
tcagacacct 
tctgggtttt 
tgaaccaaaa 
agtcaaattt 
tcagacacct 
gagtggggaa 
acaagctttt 
caatgctttt 
aggtgaaaca 
cggacacagg 
taaagcaaaa 
ctttctcaac 
aggatgttgc 
caagtacaaa 
aacttttccc 
cgtgcaaaca 
ttcagctcat 
gaaagccaga 
gttaacacag 
cccagctgcg 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2346 



<210> 322 

<211> 2420 

<212> DNA 

<213> Homo sapiens 



<400> 322 

tccggggcgg 

cggaagcgga 

ccgccccctt 

ctctgcctgg 

ctggggaccc 

gatgccgccc 

aagtgacagg 

aatacaggcc 

tgccgagcag 

ggaccccaac 

ctacgtccag 

ccacggcaag 

gttcctggtg 

cggcaaggtg 

ggtgtacttt 

ctgtacgcgc 

ctaccgcagc 

gggggagttc 

cattaagaac 
gcggcatagc 
catcgtcact 
gtcagtgctg 
atacctggag 
tgaggacaac 
ggacacgggc 
ctccactaag 
gcgagtgcct 
caagatggat 
gcacctggac 
caaaacccac 
actgaacctg 
gccccagcgg 



cccccggcag 
actctgccgg 
cccccgcctt 
cggtccctcc 
ctgcgcccct 
gcggcgaggt 
ttggctttac 
gcctggccat 
gacctgccct 
tggtacaaag 
aagcgggagg 
atcacacggg 
cgggagagca 
gagcactacc 
gagaacctca 
ctcattaaac 
ggctgggccc 
ggagacgtga 
gacgccactg 
aacctggtgc 
gagtacatgg 
ggcggagact 
ggcaacaatt 
gtggccaagg 
aagctgccag 
tctgacgtgt 
tatccaagaa 
gcccccgacg 
gccgccatgc 
gagctgcacc 
gaagatcatg 
gctggcgggc 



ccagcgcgac 
ggccgcgccg 
tcttccctcc 
tcccctctcc 
cccctccccc 
ggcccggacc 
tgtgactcgg 
ccggtacaga 
tctgcaaagg 
ccaaaaacaa 
gcgtgaaggc 
agcaggctga 
ccaactaccc 
gcatcatgta 
tgcagctggt 
caaaggtcat 
tgaacatgaa 
tgctgggcga 
cccaggcctt 
agctcctggg 
ccaaggggag 
gtctcctcaa 
tcgtgcatcg 
tcagcgactt 
tcaagtggac 
ggagtttcgg 
ttcccctgaa 
gctgcccgcc 
ggccctcctt 
tgtgacggct 
gacctggtgc 
ctttttcctg 



gttccaaaat cgaacctcag 
gctacattgt ttcctccccc 
gcgacccggg ccgtgcgtcc 
ttgcacGcat acctctttgt 
ctgaccgcat ggaccgtccc 
gcagtgcccc aagagagctc 
ggacgccaga gctcctgaga 
atgtattgcc aagtacaact 
agacgtgctc accattgtgg 
ggtgggccgt gagggcatca 
gggtaccaaa ctcagcctca 
gcggcttctg tacccgccgg 
cggagactac acgctgtgcg 
ccatgccagc aagctcagca 
ggagcactac acctcagacg 
ggagggcaca gtggcggccc 
ggagctgaag ctgctgcaga 
ttaccgaggg aacaaagtcg 
cctggctgaa gcctcagtca 
cgtgatcgtg gaggagaagg 
ccttgtggac tacctgcggt 
gttctcgcta gatgtctgcg 
agacctggct gcccgcaatg 
tggtctcacc aaggaggcgt 
agcccctgag gccctgagag 
aatccttctc tgggaaatct 
ggacgtcgtc cctcgggtgg 
cgcagtctat gaagtcatga 
cctacagctc cgagagcagc 
ggcctccgcc tgggtcatgg 
ccctgctcac tgggcccgag 
cgtcccagcc tgcacccctc 
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tggcggcgct 
cgactccctc 
gtccccctgc 
accgcacccc 
gcaggccgct 
taatggtacc 
agatgtcagc 
tccacggcac 
ccgtcaccaa 
tcccagccaa 
tgccttggtt 
agacaggcct 
tgagctgcga 
tcgacgagga 
cagatggact 
aggatgagtt 
ccatcgggaa 
ccgtcaagtg 
tgacgcaact 
gcgggctcta 
ctaggggtcg 
aggccatgga 
tgctggtgtc 
ccagcaccca 
agaagaaatt 
actcctttgg 
agaagggcta 
agaactgctg 
ttgagcacat 
gcctgtgggg 
cctgaactga 
cggccccgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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tctcttggac ccacctgtgg ggcctgggga gcccactgag gggccaggga ggaaggaggc 1980 
cacggagcgg gcggcagcgc cccaccacgt cgggcttccc tggcctcccg ccactcgcct 2040 
tcttagagtt ttattccttt ccttttttga gatttttttt ccgtgtgttt attttttatt 2100 
atttttcaag ataaggagaa agaaagtacc cagcaaatgg gcattttaca agaagtacga 2160 
atcttatttt tcctgtcctg cccgtgaggt gggggggacc gggcccctct ctagggaccc 2220 
ctcgccccag cctcattccc cattctgtgt cccatgtccc gtgtctcctc ggtcgccccg 2280 
tgtttgcgct tgaccatgtt gcactgtttg catgcgcccg aggcagacgt ctgtcagggg 2340 
cttggatttc gtgtgccgct gccacccgcc cacccgcctt gtgagatgga atcgtaataa 2400 
accacgccat gaggaaaaaa 2420 

<210> 323 

<211> 2253 

<212> DNA 

<213> Homo sapiens 

<400> 323 

ggaagacttg ggtccttggg tcgcaggtgg gagccgacgg gtgggtagac cgtgggggat 60 
atctcagtgg cggacgagga cggcggggac aaggggcggc tggtcggagt ggcggagcgt 120 
caagtcccct gtcggttcct ccgtccctga gtgtccttgg cgctgccttg tgcccgccca 180 
gcgcctttgc atccgctcct gggcaccgag gcgccctgta ggatactgct tgttacttat 240 
tacagctaga ggcatcatgg accgatctaa agaaaactgc atttcaggac ctgttaaggc 300 
tacagctcca gttggaggtc caaaacgtgt tctcgtgact cagcaaattc cttgtcagaa 360 
tccattacct gtaaatagtg gccaggctca gcgggtcttg tgtccttcaa attcttccca 420 
gcgcgttcct ttgcaagcac aaaagcttgt ctccagtcac aagccggttc agaatcagaa 480 
gcagaagcaa ttgcaggcaa ccagtgtacc tcatcctgtc tccaggccac tgaataacac 540 
ccaaaagagc aagcagcccc tgccatcggc acctgaaaat aatcctgagg aggaactggc 600 
atcaaaacag aaaaatgaag aatcaaaaaa gaggcagtgg gctttggaag actttgaaat 660 
tggtcgccct ctgggtaaag gaaagtttgg taatgtttat ttggcaagag aaaagcaaag 720 
caagtttatt ctggctctta aagtgttatt taaagctcag ctggagaaag ccggagtgga 780 
gcatcagctc agaagagaag tagaaataca gtcccacctt cggcatccta atattcttag 840 
actgtatggt tatttccatg atgctaccag agtctaccta attctggaat atgcaccact 900 
tggaacagtt tatagagaac ttcagaaact ttcaaagttt gatgagcaga gaactgctac 960 
ttatataaca gaattggcaa atgccctgtc ttactgtcat tcgaagagag ttattcatag 1020 
agacattaag ccagagaact tacttcttgg atcagctgga gagcttaaaa ttgcagattt 1080 
tgggtggtca gtacatgctc catcttccag gaggaccact ctctgtggca ccctggacta 1140 
cctgccccct gaaatgattg aaggtcggat gcatgatgag aaggtggatc tctggagcct 1200 
tggagttctt tgctatgaat ttttagttgg gaagcctcct tttgaggcaa acacatacca 1260 
agagacctac aaaagaatat cacgggttga attcacattc cctgactttg taacagaggg 1320 
agccagggac ctcatttcaa gactgttgaa gcataatccc agccagaggc caatgctcag 1380 
agaagtactt gaacacccct ggatcacagc aaattcatca aaaccatcaa attgccaaaa 1440 
caaagaatca gctagcaaac agtcttagga atcgtgcagg gggagaaatc cttgagccag 1500 
ggctgccata taacctgaca ggaacatgct actgaagttt attttaccat tgactgctgc 1560 
cctcaatcta gaacgctaca caagaaatat ttgttttact cagcaggtgt gccttaacct 1620 
ccctattcag aaagctccac atcaataaac atgacactct gaagtgaaag tagccacgag 1680 
aattgtgcta cttatactgg ttcataatct ggaggcaagg ttcgactgca gccgccccgt 1740 
cagcctgtgc taggcatggt gtcttcacag gaggcaaatc cagagcctgg ctgtggggaa 1800 
agtgaccact ctgccctgac cccgatcagt taaggagctg tgcaataacc ttcctagtac 1860 
ctgagtgagt gtgtaactta ttgggttggc gaagcctggt aaagctgttg gaatgagtat 1920 
gtgattcttt ttaagtatga aaataaagat atatgtacag acttgtattt tttctctggt 1980 
ggcattcctt taggaatgct gtgtgtctgt ccggcacccc ggtaggcctg attgggtttc 2040 
tagtcctcct taaccactta tctcccatat gagagtgtga aaaataggaa cacgtgctct 2100 
acctccattt agggatttgc ttgggataca gaagaggcca tgtgtctcag agctgttaag 2160 
ggcttatttt tttaaaacat tggagtcata gcatgtgtgt aaactttaaa tatgcaaata 2220 
aataagtatc tatgtctaaa aaaaaaaaaa aaa 2253 

<210> 324 

<211> 1619 

<212> DNA 

<213> Homo sapiens 

<400> 324 

ccgccagatt tgaatcgcgg gacccgttgg cagaggtggc ggcggcggca tgggtgcccc 60 
gacgttgccc cctgcctggc agccctttct caaggaccac cgcatctcta cattcaagaa 120 
ctggcccttc ttggagggct gcgcctgcac cccggagcgg atggccgagg ctggcttcat 180 
ccactgcccc actgagaacg agccagactt ggcccagtgt ttcttctgct tcaaggagct 240 
ggaaggctgg gagccagatg acgaccccat agaggaacat aaaaagcatt cgtccggttg 300 
cgctttcctt tctgtcaaga agcagtttga agaattaacc cttggtgaat ttttgaaact 360 
ggacagagaa agagccaaga acaaaattgc aaaggaaacc aacaataaga agaaagaatt 420 
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tgaggaaact 
cctctggccg 
gtgccaccag 
caaattagat 
tgcctgtgca 

gggggrtcat 

aaggcagtgt 
gtgaatgtgt 
ggtgcctgtt 
acagtttttt 
gtgatgagag 
tattttgttt 
agccattcta 
agtgatagga 
agtgagccgc 
ctttttaaat 
tctgtcagcc 
ccaggtcccc 
gatggatttg 
gctggaaacc 



gcgaagaaag 
gagctgcctg 
ccttcctgtg 
gtttcaactg 
gcgggtgctg 
ttttgctgtt 
cccttttgct 
ctggacctca 
gaatctgagc 
tgttgttgtg 
aatggagaca 
gaattgttaa 
agtcattggg 
agcgtctggc 
ggggcacatg 
gacttggctc 
caaccttcac 
gctttctttg 
attcgccctc 
tctggaggtc 



39740-OOOlPCT. txt 
tgcgccgtgc catcgagcag ctggctgcca 
gtcccagagt ggctgcacca cttccagggt 
ggccccttag caatgtctta ggaaaggaga 
tgctcctgtt ttgtcttgaa agtggcacca 
ctggtaacag tggctgcttc tctctctctc 
ttgattcccg ggcttaccag gtgagaagtg 
agagctgaca gctttgttcg cgtgggcaga 
tgttgttgag gctgtcacag tcctgagtgt 
tgcaggttcc ttatctgtca cacctgtgcc 
tttttttgtt tttttttttt ggtagatgca 
gagtccctgg ctcctctact gtttaacaac 
ttcacagaat agcacaaact acaattaaaa 
gaaacggggt gaacttcagg tggatgagga 
agatactcct tttgccactg ctgtgtgatt 
ctggccgctc ctccctcaga aaaaggcagt 
gatgctgtgg gggactggct gggctgctgc 
atctgtcacg ttctccacac gggggagaga 
gaggcagcag ctcccgcagg gctgaagtct 
ctccctgtca tagagctgca gggtggattg 
atctcggctg ttcctgagaa ataaaaagcc 



tggattgagg 
ttattccctg 
tcaacatttt 
gaggtgcttc 
tctctttttt 
agggaggaag 
gccttccaca 
ggacttggca 
tcctcagagg 
tgacttgtgt 
atggctttct 
ctaagcacaa 
gacagaatag 
agacaggccc 
ggcctaaatc 
aggccgtgtg 
cgcagtccgc 
ggcgtaagat 
ttacagcttc 
tgtcatttc 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1619 



<210> 325 

<211> 5010 

<212> DNA 

<213> Homo sapiens 



<400> 325 

ggcggctcgg 

tcgtgtcctc 

gccataggga 

atggagcggg 

gttcttctgt 

ttggtggaga 

acagtcatgt 

aggccaatgt 

tgatcgtctt 

aaccaaaaac 

gagaggactt 

agaaactgga 

tccctcgtga 

ttcgtgaatt 

aagacagcgc 

tggagaatcc 

tccatgctaa 

ctatagtgat 

gcttaaatgc 

cagaactttc 

tcccttcctt 

ctgtccagac 

gtccctctga 

tgaagctcac 

ttaaaggctt 

gccctggagc 

tctcagatat 

ggagtgctgg 

ccctgcattt 

acttcaaggt 

tgaagcatcc 

agaaactcac 

tttctttctg 

ataaggaact 

tcgctggtca 

ggtacaacag 

aggaaatggg 

cttccagact 

aactcaatga 

aagagtctcc 

tggagaactt 



gacggaggac 

ccttcatcct 
gccgcacggg 
gccgcgagcc 
gtggcagttc 
accattgtca 
ggagatgaaa 
cacaaaacca 
tttcttgatt 
tgagtgtgag 
ccctgcagca 
cagcacagac 
ggctggatct 
taaactcagc 
tcaaaactcg 
tgggggttat 
ttttggtact 
tgtcagagca 
aattggtgtg 
attctttgga 
caatcacact 
aatctccaga 
ctggaaaaca 
tgtgaggaat 
tgtagaacca 
tgcaaaatcc 
ggtcttaaaa 
agactttgga 
aaaggctttc 
litctgccagc 
ggttactggg 
tttagacaat 
tttttgcgag 
gattgagagg 
gttcgtgatt 
ccaactgctt 
cctgagttta 
aacaacagat 
tcgtgtcatg 
tttccgacat 
gaaactgcgt 



gcgctagtgt 
gcggggctgg 
gagcgggaaa 
tgtggggaag 
agaatgatgg 
tatacccggt 
cttgctgtag 
aaaaggtgta 
ggatttatga 
agactggcag 
cgtcgcttat 
ttcaccagca 
caaaaagatg 
aaagtctggc 
gtgatcatag 
gtggcgtata 
aaaaaagatt 
gggaaaatca 
ttgatataca 
catgctcatc 
cagtttccac 
gctgctgcag 
gactctacat 
gtgctgaaag 
gatcactatg 
ggtgtaggca 
gatgggtttc 
tcggttggtg 
acttatatta 
ccactgttgt 
caatttctat 
gctgctttcc 
gacacagatt 
attcctgagt 
aaactaaccc 
tcatttgtga 
cagtggctgt 
ttcgggaatg 
agagtggagt 
gtcttctggg 
aaacaaaata 



gagtgcgggc ttctagaact 
ctggagcggc cgctccggtg 
gcggtcgcgg ccccaggcgg 
gggctgtggc ggcgcctcga 
atcaagctag atcagcattc 
tcagcctggc tcggcaagta 
atgaagaaga aaatgctgac 
gtggaagtat ctgctatggg 
ttggctactt gggctattgt 
gaaccgagtc tccagtgagg 
^tttgggatga cctgaagaga 
ccatcaagct gctgaatgaa 
aaaatcttgc gttgtatgtt 
gtgatcaaca ttttgttaag 
ttgataagaa cggtagactt 
gtaaggctgc aacagttact 
ttgaggattt atacactcct 
cctttgcaga aaaggttgca 
tggaccagac taaatttccc 
tggggacagg tgacccttac 
catctcggtc atcaggattg 
aaaagctgtt tgggaatatg 
gtaggatggt aacctcagaa 
agataaaaat tcttaacatc 
ttgtagttgg ggcccagaga 
cagctctcct attgaaactt 
agcccagcag aagcattatc 
ccactgaatg gctagaggga 
atctggataa agcggttctt. 
atacgcttat tgagaaaaca 
atcaggacag caactgggcc 
ctttccttgc atattctgga 
atccttattt gggtaccacc 
tgaacaaagt ggcacgagca 
atgatgttga attgaacctg 
gggatctgaa ccaatacaga 
attctgctcg tggagacttc 
ctgagaaaac agacagattt 
atcacttcct ctctccctac 
gctccggctc tcacacgctg 
acggtgcttt taatgaaacg 
Page 70 



acaccgaccc 
ctgtccagca 
ggcggccggg 
gcggctgcag 
tctaacttgt 
gatggcgata 
aataacacaa 
actattgctg 
aaaggggtag 
gaggagccag 
aagttgtcgg 
aattcatatg 
gaaaatcaat 
attcaggtca 
gtttacctgg 
ggtaaactgg 
gtgaatggat 
aatgctgaaa 
attgttaacg 
acacctggat 
cctaatatac 
gaaggagact 
agcaagaatg 
tttggagtta 
gatgcatggg 
gcccagatgt 
tttgccagtt 
tacctttcgt 
ggtaccagca 
atgcaaaatg 
agcaaagttg 
atcccagcag 
atggacacct 
gctgcagagg 
gactatgaga 
gcagacataa 
ttccgtgcta 
gtcatgaaga 
gtatctccaa 
ccagctttac 
ctgttcagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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39740-OOOlPCT-txt 

accagttggc tctagctact tggactattc agggagctgc aaatgccctc tctggtgacg 2520 
tttgggacat tgacaatgag ttttaaatgt gatacccata gcttccatga gaacagcagg 2580 
gtagtctggt ttctagactt gtgctgatcg tgctaaattt tcagtagggc tacaaaacct 2640 
gatgttaaaa ttccatccca tcatcttggt actactagat gtctttaggc agcagctttt 2700 
aatacagggt agataacctg tacttcaagt taaagtgaat aaccacttaa aaaatgtcca 2760 
tgatggaata ttcccctatc tctagaattt taagtgcttt gtaatgggaa ctgcctcttt 2820 
cctgttgttg ttaatgaaaa tgtcagaaac cagttatgtg aatgatctct ctgaatccta 2880 
agggctggtc tctgctgaag gttgtaagtg gttcgcttac tttgagtgat cctccaactt 2940 
catttgatgc taaataggag ataccaggtt gaaagacctc tccaaatgag atctaagcct 3000 
ttccataagg aatgtagcag gtttcctcat tcctgaaaga aacagttaac tttcagaaga 3060 
gatgggcttg ttttcttgcc aatgaggtct gaaatggagg tccttctgct ggataaaatg 3120 
aggttcaact gttgattgca ggaataaggc cttaatatgt taacctcagt gtcatttatg 3180 
aaaagagggg accagaagcc aaagacttag tatattttct tttcctctgt cccttccccc 3240 
ataagcctcc atttagttct ttgttatttt tgtttcttcc aaagcacatt gaaagagaac 3300 
cagtttcagg tgtttagttg cagactcagt ttgtcagact ttaaagaata atatgctgcc 3360 
aaattttggc eaaagtgtta atcttagggg agagctttct gtccttttgg cactgagata 3420 
tttattgttt atttatcagt gacagagttc actataaatg gtgttttttt aatagaatat 3480 
aattatcgga agcagtgcct tccataatta tgacagttat actgtcggtt ttttttaaat 3540 
aaaagcagca tctgctaata aaacccaaca gatactggaa gttttgcatt tatggtcaac 3600 
acttaagggt tttagaaaac agccgtcagc caaatgtaat tgaataaagt tgaagctaag 3660 
atttagagat gaattaaatt taattagggg ttgctaagaa gcgagcactg accagataag 3720 
aatgctggtt ttcctaaatg cagtgaattg tgaccaagtt ataaatcaat gtcacttaaa 3780 
ggctgtggta gtactcctgc aaaattttat agctcagttt atccaaggtg taactctaat 3840 
tcccatttgc aaaatttcca gtacctttgt cacaatccta acacattatc gggagcagtg 3900 
tcttccataa tgtataaaga acaaggtagt ttttacctac cacagtgtct gtatcggaga 3960 
cagtgatctc catatgttac actaagggtg taagtaatta tcgggaacag tgtttcccat 4020 
aattttcttc atgcaatgac atcttcaaag cttgaagatc gttagtatct aacatgtatc 4080 
ccaactccta taattcccta tcttttagtt ttagttgcag aaacattttg tggtcattaa 4140 
gcattgggtg ggtaaattca accactgtaa aatgaaatta ctacaaaatt tgaaatttag 4200 
cttgggtttt tgttaccttt atggtttctc caggtcctct acttaatgag atagcagcat 4260 
acatttataa tgtttgctat tgacaagtca ttttaattta tcacattatt tgcatgttac 4320 
ctcctataaa cttagtgcgg acaagtttta atccagaatt gaccttttga cttaaagcag 4380 
agggactttg tatagaaggt ttgggggctg tggggaagga gagtcccctg aaggtctgac 4440 
acgtctgcct acccattcgt ggtgatcaat taaatgtagg tatgaataag ttcgaagctc 4500 
cgtgagtgaa ccatcatata aacgtgtagt acagctgttt gtcatagggc agttggaaac 4560 
ggcctcctag ggaaaagttc atagggtctc ttcaggttct tagtgtcact tacctagatt 4620 
tacagcctca cttgaatgtg tcactactca cagtctcttt aatcttcagt tttatcttta 4680 
atctcctctt ttatcttgga ctgacattta gcgtagctaa gtgaaaaggt catagctgag 4740 
attcctggtt cgggtgttac gcacacgtac ttaaatgaaa gcatgtggca tgttcatcgt 4800 
ataacacaat atgaatacag ggcatgcatt ttgcagcagt gagtctcttc agaaaaccct 4860 
tttctacagt tagggttgag ttacttccta tcaagccagt acgtgctaac aggctcaata 4920 
ttcctgaatg aaatatcaga ctagtgacaa gctcctggtc ttgagatgtc ttctcgttaa 4980 
ggagtagggc cttttggagg taaaggtata 5010 



<210> 326 

<211> 2574 

<212> DNA 

<213> Homo sapiens 

<400> 326 

cctgtttaga cacatggaca acaatcccag cgctacaagg cacacagtcc gcttcttcgt 60 
cctcagggtt gccagcgctt cctggaagtc ctgaagctct cgcagtgcag tgagttcatg 120 
caccttcttg ccaagcctca gtctttggga tctggggagg ccgcctggtt ttcctccctc 180 
cttctgcacg tctgctgggg tctcttcctc tccaggcctt gccgtccccc tggcctctct 240 
tcccagctca cacatgaaga tgcacttgca aagggctctg gtggtcctgg ccctgctgaa 300 
ctttgccacg gtcagcctct ctctgtccac ttgcaccacc ttggacttcg gccacatcaa 360 
gaagaagagg gtggaagcca ttaggggaca gatcttgagc aagctcaggc tcaccagccc 420 
ccctgagcca acggtgatga cccacgtccc ctatcaggtc ctggcccttt acaacagcac 480 
ccgggagctg ctggaggaga tgcatgggga gagggaggaa ggctgcaccc aggaaaacac 540 
cgagtcggaa tactatgcca aagaaatcca taaattcgac atgatccagg ggctggcgga 600 
gcacaacgaa ctggctgtct gccctaaagg aattacctcc aaggttttcc gcttcaatgt 660 
gtcctcagtg gagaaaaata gaaccaacct attccgagca gaattccggg tcttgcgggt 720 
gcccaacccc agctctaagc ggaatgagca gaggatcgag ctcttccaga tccttcggcc 780 
agatgagcac attgccaaac agcgctatat cggtggcaag aatctgccca cacggggcac 840 
tgccgagtgg ctgtcctttg atgtcactga cactgtgcgt gagtggctgt tgagaagaga 900 
gtccaactta ggtctagaaa tcagcattca ctgtccatgt cacacctttc agcccaatgg 960 
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agatatcctg 
ggatgaccat 
tcatctaatc 
gaggaagaag 
tgtgcgcccc 
taagggctac 
aacccacagc 
ttgctgcgtg 
caaagtggag 
gtgcgacaga 
acacaagcaa 
ggctgagatg 
ggtaaagaaa 
ctgtgacgca 
tggcaatggg 
ggccagactg 
gctctaggga 
acgaagacaa 
cttttgcaaa 
ggtcatgcag 
gaaagggtgg 
tcgatcatat 
tggattgtgg 
accctgtgtt 
agctgcacat 
agtatgaata 
atttcatctt 



gaaaacattc 
ggccgtggag 
ctcatgatga 

cgggctttgg 

ctctacattg 
tatgccaact 
acggtgctgg 
ccccaggacc 
cagctctcca 
gagaggggag 
caaacctcac 
gaggtttcct 
gtgtgggttt 
gacagagggg 
atttgggcta 
gaagacactt 
atctggatta 
agtcccagaa 
ctgtcctcta 
ttcctggccc 
aaatcaaccc 
ttccccttgg 
ttccatgcag 
catttggtgt 
gtgccacaca 
ttactctcaa 
ctggaatatt 



39740-OOOlPCT . txt 
acgaggtgat ggaaatcaaa ttcaaaggcg 
atctggggcg cctcaagaag cagaaggatc 
ttcccccaca ccggctcgac aacccgggcc 
acaccaatta ctgcttccgc aacttggagg 
acttccgaca ggatctgggc tggaagtggg 
tctgctcagg cccttgccca tacctccgca 
gactgtacaa cactctgaac cctgaagcat 
tggagcccct gaccatcctg tactatgttg 
acatggtggt gaagtcttgt aaatgtagct 
agagaaccac cactgcctga ctgcccgctc 
tgagaggcct ggagcccaca accttcggct 
tttggaacat ttctttcttg ctggctctga 
ggttagagga aggctgaact cttcagaaca 
atggggatag aggaaaggga tggtaagttg 
ccctaaaggg agaaggaagg gcagagaatg 
cagatctgag gttggatttg ctcattgctg 
tgttatacaa ggcaagcatt ttttttttta 
ttgtatctca tactgtctgg gattaagggc 
catcaattaa catcgtgggt cactacaggg 
atcaactgta ttgggccttt tggatatgct 
tctcctgtct gccctctggg tccctcctct 
acacttggtt agacgccttc caggtcagga 
ccttggggca ttatgggtct tcccccactt 
tcctggaagc aggtgctaca acatgtgagg 
gtgacttggc cccagacgca tagactgagg 
aatctttgta taaataaata tttttggggc 
gtttctagaa cagtaaaagc cttattctaa 



tggacaatga 
accacaaccc 
aggggggtca 
agaactgctg 
tccatgaacc 
gtgcagacac 
ctgcctcgcc 
ggaggacccc 
gagaccccac 
ctcgggaaac 
ccgggcaaat 
gaatcacggt 
cacagacttt 
agatgttgtg 
gctgggtcag 
taccacatct 
aagacaggtt 
aaatctatta 
agaaaatcca 
gaacgcagaa 
cacctctccc 
tgcacatttc 
cccctccaag 
cattcgggga 
tataaagaca 
atcctggatg 

ggtg 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2574 



<210> 327 

<211> 1421 

<212> DNA 

<213> Homo sapiens 



<400> 327 

acttactgcg 

agctgcatta 

caggtgattc 

cgcttccaga 

agcagcagct 

cgagacgtgg 

ttccctgaca 

gctgcactgg 

ctggccgaga 

tataccaaga 

cactccgtgt 

aacaaagaga 

tttgccccac 

ccgctccctt 

caggaggaag 

tgtgtggctg 

cttccctctc 

ggatctggca 

cctgcttctt 

gccagcacct 

cccaccctcc 

cccttcctac 

ggcaccaacc 

agggtccacg 



ggacggcctt 
acctgcccac 
tcgggccgat 
ttgctcagta 
tctgcacaca 
cccaggaggc 
tcatggagtt 
atgggacctt 
gcgtggtgaa 
ggctcggcac 
gtcggctctg 
actgcccagt 
agcagattct 
cctgccactg 
tcgggaggcg 
ccccacctgc 
agctgctggg 
cactccctct 
cccctctgcg 
ttgagccttg 
cctgaggatg 
ctctggtgat 
ttgctgggac 
cctctgctgt 



ggagagtact 
tgtgctgccc 
gttctcagga 
caagtgcctg 
tgaccggaac 
cctgggcgtg 
ctgcgaggcc 
ccagaggaag 
gctgacggcg 
agagaaggag 
ctacttcaag 
gccaggaaag 
gcaatgcagc 
ccgcctactg 
tggagggtga 
cgcatgctcc 
acgatcgccc 
ccttggggtg 
gctttcactg 
gcccacactg 
gcctggattc 
ggtttccaca 
ttggatccca 
agcttatgaa 



cgggttcgtg 
ggctccccca 
aaaagcacag 
gtgatcaagt 
accatggagg 
gctgtcatag 
atggccaacg 
ccatttgggg 
gtgtgcatgg 
gtcgaggtga 
aaggcctcag 
ccaggggaag 
cctgccaact 
gacgctgccc 
ccacaccttg 
ctcctctcct 
aggctggagc 
agggacagag 
ctgagtttct 
aggcttaggc 
acgccctctt 
ggaacaacag 
ggggcttatc 
attaactaat 



aacttcccgg 
gcaagacccg 
agttgatgag 
atgccaaaga 
cgctgcccgc 
gcatcgacga 
ccgggaagac 
ccatcctgaa 
agtgcttccg 
ttgggggagc 
gccagcctgc 
ccgtggctgc 
gagggacctg 
tgcatgctgc 
gccttctggg 
acccactggt 
tggccccgct 
ccccacgctg 
gttctccctg 
ctctctgcct 
gtttcctttt 
catctttcac 
tcttcaagtg 
t 



aggcgcaatg 
ggggcagatc 
acgcgtccgt 
cactcgctac 
ctgcctgctc 
ggggcagttt 
cgtaattgtg 
cctggtgccg 
ggaagccgcc 
agacaagtac 
cgggccggac 
caggaagctc 
caagggccgc 
ccagccactc 
aactctcctt 
ctgcttaaag 
tggtggcctg 
ttgacatcag 
ggaagcctgt 
gggatgggct 
gggctcaaag 
caagatgggt 
tggagagggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1421 



<210> 328 

<211> 4604 

<212> DNA 

<213> Homo sapiens 

<400> 328 ' 
ggaacagctt gtccacccgc cggccggacc agaagccttt gggtctgaag tgtctgtgag 60 
acctcacaga agagcacccc tgggctccac ttacctgccc cctgctcctt cagggatgga 120 
ggcaatggcg gccagcactt ccctgcctga ccctggagac tttgaccgga acgtgccccg 180 
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gatctgtggg gtgtgtggag accgagccac 
aggctgcaaa ggcttcttca. ggcgaagcat 
caacggggac tgccgcatca ccaaggacaa 
acgctgtgtg gacatcggca tgatgaagga 
gaagcgggag atgatcctga agcggaagga 
caagctgtct gaggagcagc agcgcatcat 
ctacgacccc acctactccg acttctgcca 
tggagggagc catccttcca ggcccaactc 
ctcctcctcc tgctcagatc actgtatcac 
ctccaatctg gatctgagtg aagaagattc 
ccagctctcc atgctgcccc acctggctga 
tggctttgct aagatgatac caggattcag 
gctgaagtca agtgccattg aggtcatcat 
cgacatgtcc tggacctgtg gcaaccaaga 
agccggacac agcctggagc tgattgagcc 
gctgaacttg catgaggagg agcatgtcct 
tcgtcctggg gtgcaggacg ccgcgctgat 
actgcagacg tacatccgct gccgccaccc 
gatgatccag aagctagccg acctgcgcag 
ctgcctctcc ttccagcctg agtgcagcat 
tggcaatgag atctcctgac taggacagcc 
agggccacgt gccaggcccg gggctggcgg 
gttcagcccc tcctctgcca cctcccctat 
ctaacccctt tcctgcgggc ttttccccgg 
tgtttgtttg acaaagaaac ccaagtgggg 
cccagagatg cctccaccgc tgcctaagtg 
gagaaatgca tccattcctc agggacagag 
gcttgtccag cgcctagtgg ggtctccctc 
cacagctccc accccacccc cttcagtgcc 
tcacgggcag tagctgtggt gaggtgggtt 
acccacctgc tgagagaccc aaggaggaaa 
tgacagctgt ccctgtcacc aagctcacag 
aggtggaagc cctccttcca cggatccatg 
aaaagcacct gccgacctcg tcctccccct 
cagccctccc tgtcctcctc ggatcaccga 
tccccagggg agagggtctg gagaagcagt 
gatggaggag aagaattttc agaccccagc 
tgtggttgca aggccgctgt tcaccacagg 
gtgggaaggg agagcggggc agtctcgggt 
gtgatgtagg gtaaggtgcc ttcttattct 
tgtgaggcag gggcggagtg atacaacttc 
cccagctggt gggaagcgtc tgtccgttta 
gtaggtgtgc gggaccggta cagaaaggcg 
agatcaatgc ttgagtttgg aatcggccgc 
agtgggaacg tgactgcccc aactcctgga 
ccctgaaaac ccagagagga atcagacttc 
cccatgtctc tcagaattct tcaggtggaa 
gaatagcata tatatcgctt aatcttaaat 
actccatttg tattatagtc taatatacag 
tatgggggga gaacttacat tgtgaaactt 
tttacaaggg tctagggaga gacccttgtt 
gcttgctctt cagtgggaga aaaacacttg 
caggaaaact gacagaggag ggcgtgactc 
ggccaggaca ggccgggcgc ggtggctcac 
ggtaggtgga tcacctgagg tcgggagttc 
tgtctctatt aaaaatacaa aaaaaaaaaa 
gcctgtaatc ccagctactc aggaggctga 
ttgcagtgag ctgagatcgt gccgttactc 
tcttagaagt ggaccaggac aggaccagat 
tacaccatgt ttgagctcag acccccactc 
ggggaagccc tggatttcag aaagagccaa 
ctggcttgta actccaccaa gcccatcaga 
aatgtgaatc agaccctacc ccaccacgat 
gccttggata atgctgttgc ctcatctata 
ttcccagctc tccctctggc cctgcttctt 
ccctgggccc caccacctag gccactcctc 
aaaagagggg aaaatgagtt tttatggggc 
tactttagaa tgagagtgtg aaatagacat 
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tggctttcac ttcaatgcta tgacctgtga 240 

gaagcggaag gcactattca cctgcccctt 300 

ccgacgccac tgccaggcct gccggctcaa 360 

gttcattctg acagatgagg aagtgcagag 420 

ggaggaggcc ttgaaggaca gtctgcggcc 480 

tgccatactg ctggacgccc accataagac 540 

gttccggcct ccagttcgtg tgaatgatgg 600 

cagacacact cccagcttct ctggggactc 660 

ctcttcagac atgatggact cgtccagctt 720 

agatgaccct tctgtgaccc tagagctgtc 780 

cctggtcagt tacagcatcc aaaaggtcat 840 

agacctcacc tctgaggacc agatcgtact 900 

gttgcgctcc aatgagtcct tcaccatgga 960 

ctacaagtac cgcgtcagtg acgtgaccaa 1020 

cctcatcaag ttccaggtgg gactgaagaa 1080 

gctcatggcc atctgcatcg tctccccaga 1140 

tgaggcQatc caggaccgcc tgtccaacac 1200 

gcccccgggc agccacctgc tctatgccaa 1260 

cctcaatgag gagcactcca agcagtaccg 1320 

gaagctaacg ccccttgtgc tcgaagtgtt 1380 

tgtgcggtgc ctgggtgggg ctgctcctcc 1440 

ctactcagca gccctcctca cccgtctggg 1500 

ccacccagcc cattctctct cctgtccaac 1560 

tcccttgaga cctcagccat gaggagttgc 1620 

gcagagggca gaggctggag gcaggccttg 1680 

gctgctgact gatgttgagg gaacagacag 1740 

acacctgcac ctccccccac tgcaggcccc 1800 

tcctgcctta ctcacgataa ataatcggcc 1860 

caccaacatc ccattgccct ggttatattc 1920 

ttcttcccat cactggagca ccaggcacga 1980 

aacagacaaa aacagcctca cagaagaata 2040 

ttcctcgccc tgggtctaag gggttggttg 2100 

tagcaggact gaattgtccc cagtttgcag 2160 

gccagtgcct tacctcctgc ccaggagagc 2220 

gagtagccga gagcctgctc ccccaccccc 2280 

gagccgcatc ttctccatct ggcagggtgg 2340 

ggctgagtca tgatctccct gccgcctcaa 2400 

gctaagagct aggctgccgc accccagagt 2460 

ggctagtcag agagagtgtt tgggggttcc 2520 

cactccacca cccaaaagtc aaaaggtgcc 2580 

aagtgcatgc tctctgcagg tcgagcccag 2640 

ctccaaggtg ggtctttgtg agagtgagct 2700 

ttcttcgagg tggatcacag aggcttcttc 2760 

attccctgag tcaccaggaa tgttaaagtc 2820 

agctgtgtcc ttgcacctgc atccgtagtt 2880 

acactgcaag agccttggtg tccacctggc 2940 

aaacatctga aagccacgtt ccttactgca 3000 

ttattagata tgagttgttt tcagactcag 3060 

ggtagcaggt accactgatt tggagatatt 3120 

ctgtacatta attattattg ctgttgttat 3180 

tgattttagc tgcagaactg tattggtcca 3240 

taagttgcta aacgagtcaa tcccctcatt 3300 

acccaagcca tatataacta gctagaagtg 3360 

gcctgtaatc ccagcagttt gggaggtcga 3420 

gagaccaacc tgaccaacat ggagaaaccc 3480 

aaaaaaaaat agccgggcat ggtggcgcaa 3540 

ggcagaagaa ttgaacccag gaggtggagg 3600 . 

tccaacctgg acaacaagag cgaaactccg 3660 

tttggagtca tggtccggtg tccttttcac 3720 

tcattcccca ggtggctgac ccagtccctg 3780 

gtctggatct gggacccttt ccttccttcc 3840 

aggagaagga aggagactca cctctgcctc 3900 

gtgccctggc tgctgggctc tccacctcag 3960 

acatgcattt gtctttgtaa tgtcaccacc 4020 

cggggaactc ctgaaatatc agttactcag 4080 

caaaggaagt ctaggagctg ggaggaaaag 4140 

tgaacgggga gaaaaggtca tcatcgattc 4200 

ttgtaaatgt aaaactttta aggtatatca 4260 
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ttataactga 
ggaaaatcct 
gaaggcagga 
aacagtgcct 
aaccaatgta 
ctttgtttgc 



aggagaaggt 
ctggctggct 
atgtgtggca 
gctgaatgat 
gaaagctttg 
aagtactgct 



gccccaaaat 
aactggaagc 
gatttagtga 
ttccaaagag 
cttatggtaa 
gtaaataaat 



39740-000lPCT.txt 
gcaagatttt ccacaagatt 
atgtaggaga atccaagcga 
aagctagaga tatggcagcg 
aaaaaaagtt tgccagaagt 
taaaaatggc tcatacttat 
gctttatgca aacc 



cccagagaca 
ggtcaacaga 
aaaggatgta 
ttgtcaagtc 
atagcactta 



4320 
4380 
4440 
4500 
4560 
4604 



<210> 329 

<211> 2076 

<212> DNA 

<213> Homo sapiens 



<400> 329 

cggggaaggg 

gcggggtgtt 

ggcggcgtcc 

gtccggtttc 

stgggcgccct 

gcccaggggg 

cgcaggggcg 

tccttccacc 

gctgctcccg 

cctctcggac 

ggagcagtta 

ttggaaaatg 

caacctcaac 

gatcttgaaa 

tatagatgtg 

gtccgtctac 

cacgagctac 

accagtaaca 

ttacagacaa 

ggcagcgaac 

ggctcaggaa 

tgacatctgt 

ggggcttcgg 

tgtctgtaaa 

cacatgccag 

atgtgcctgt 

ccaccaaaca 

aggattttca 

aatgagctaa 

gccacagtag 

aagtctgtct 

aaaaggcctc 

tgtgatttct 

ttcattttta 

caaaatatgt 



gagggaggag 
ctggtgtccc 
tccctcgccc 
ctgtgaggct 
gcaaagttgg 
ggtcgccggg 
cccgcgcccc 
atgcacttgc 
ggtcctcgcg 
gcggagcccg 
cggtctgtgt 
tacaagtgtc 
tcaaggacag 
agtattgata 
gggaaggagt 
agatgtgggg 
ctcagcaaga 
atcagttttg 
gttcattcca 
aagacctgcc 
gattttatgt 
ggaccaaaca 
cctgccagct 
aacaaactct 
tgtgtatgta 
gaatgtacag 
tgcagctgtt 
tatagtgaag 
gattgtactg 
aactgtctgt 
ttcctgaacc 
ttgtaaagac 
ttaaaagaat 
tagcaacaac 
ttaaaataaa 



ggggacgagg 
ccgccccgcc 
tcgcttcacc 
tttacctgac 
gaacgcggag 
aggagcccgg 
cacccctgcc 
tgggcttctt 
aggcgcccgc 
acgcgggcga 
ccagtgtaga 
agctaaggaa 
aagagactat 
atgagtggag 
ttggagtcgc 
gttgctgcaa 
cgttatttga 
ccaatcacac 
ttattagacg 
ccaccaatta 
tttcctcgga 
aggagctgga 
gtggacccca 
tccccagcca 
aaagaacctg 
aaagtccaca 
acagacggcc 
aagtgtgtcg 
ttttccagtt 
gaacagagag 
atgtggataa 
tggttttctg 
gactatataa 
aattggtaaa 
atgaaaattg 



gctctggcgg 
tctccaaaaa 
tcgcgggctc 
acccgccgcc 
ccccggaccc 
gggagaggga 
cccgccagcg 
ctctgtggcg 
cgccgccgcc 
ggccacggct 
tgaactcatg 
aggaggctgg 
aaaatttgct 
aaagactcaa 
gacaaacacc 
tagtgagggg 
aattacagtg 
ttcctgccga 
ttccctgcca 
catgtggaat 
tgctggagat 
tgaagagacc 
caaagaacta 
atgtggggcc 
ccccagaaat 
gaaatgcttg 
atgtacgaac 
ttgtgtccct 
catcgatttt 
acccttgtgg 
ctttacagaa 
ccaatgacca 
tttatttcca 
actcactgtg 
tattat 



gtttggaggg 
gctacaccga 
cgaatgcggg 
tttccccggc 
gctcccgccg 
ccaggagggg 
gaccggtccc 
tgttctctgc 
gccttcgagt 
tatgcaagca 
actgtactct 
caacataaca 
gcagcacatt 
tgcatgccac 
ttctttaaac 
ctgcagtgca 
cctctctctc 
tgcatgtcta 
gcaacactac 
aatcacatct 
gactcaacag 
tgtcagtgtg 
gacagaaact 
aaccgagaat 
caacccctaa 
ttaaaaggaa 
cgccagaagg 
tcatattgga 
ctattatgga 
gtccatgcta 
atggactgga 
aacagccaag 
ctaaaaatat 
atcaatattt 



gctgaacatc 
cgcggaccgc 
gagctcggat 
actggctggg 
cctccggctc 
cccgcggcct 
ccacccccgg 
tcgccgctgc 
ccggactcga 
aagatctgga 
acccagaata 
gagaacaggc 
ataatacaga 
gggaggtgtg 
ctccatgtgt 
tgaacaccag 
aaggccccaa 
aactggatgt 
cacagtgtca 
gcagatgcct 
atggattcca 
tctgcagagc 
catgccagtg 
ttgatgaaaa 
atcctggaaa 
agaagttcca 
cttgtgagcc 
aaagaccaca 
aaactgtgtt 
acaaagacaa 
gctcatctgc 
attttcctct 
tgtttctgca 
ttatatcatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2076 



<210> 330 

<211> 2819 

<212> DNA 

<213> Homo sapiens 



<400> 330 

ctgggcccag 

ccactgacgt 

agccgccagc 

cccagctcca 

cccgccatcc 

ctgtaagatg 

ccaccggggc 

tgcacaggtg 

cttccagcct 

actgtgcctc 

acctggccac 

cgcaccccgt 

ctgcjatagcc 



ctcccccgag 
ccaggcatga 
accgtcctgg 
ctggaggaca 
ccaccccgct 
tgcgctcagc 
ctctactgtg 
gtcggtgtgg 
aactgcaagt 
cgagtgcgcc 
tgctgtgagc 
gacacaggag 
tacacaagcc 



aggtggtcgg 
ggtggttcct 
ccacggccct 
cctcctcacg 
gcccgctggg 
agcttgggga 
actacagcgg 
gctgcgtcct 
acaactgcac 
ccccgcgtct 
agtgggtatg 
ccttcgatgc 
cctggagccc 



atcctctggg ctgctcggtc 
gccctggacg ctggcagcag 
ctctccagcc cctacgacca 
cccccaattc tgcaagtggc 
ggtcagcctc atcacagatg 
caactgcacg gaggctgcca 
ggaccgcccg aggtacgcaa 
ggatggggtg cgctacaaca 
gtgcatcgac ggcgcggtgg 
ctggtgcccc cacccgcggc 
tgaggacgac gccaagaggc 
tgtgggtgag gtggaggcat 
ttgctccacc agctgcggcc 
Page 74 . 



gatgcctgtg 60 
tgacagcagc 120 
tggactttac 180 
catgtgagtg 240 
gctgtgagtg 300 
tctgtgaccc 360 
taggagtgtg 420 
acggccagtc 480 
gctgcacacc 540 
gcgtgagcat 600 
cacgcaagac 660 
ggcacaggaa 720 
tgggggtctc 780 
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cactcggatc tccaatgtta acgcccagtg ctggcctgag caagagagcc gcctctgcaa 840 
cttgcggcca tgcgatgtgg acatccatac actcattaag gcagggaaga agtgtctggc 900 
tgtgtaccag ccagaggcat ccatgaactt cacacttgcg ggctgcatca gcacacgctc 960 
ctatcaaccc aagtactgtg gagtttgcat ggacaatagg tgctgcatcc cctacaagtc 1020 
taagactatc gacgtgtcct tccagtgtcc tgatgggctt ggcttctccc gccaggtcct 1080 
atggattaat gcctgcttct gtaacctgag ctgtaggaat cccaatgaca tctttgctga 1140 
cttggaatcc taccctgact tctcagaaat tgccaactag gcaggcacaa atcttgggtc 1200 
ttggggacta acccaatgcc tgtgaagcag tcagccctta tggccaataa cttttcacca 1260 
atgagcctta gttaccctga tctggaccct tggcctccat ttctgtctct aaccattcaa 1320 
atgacgcctg atggtgctgc tcaggcccat gctatgagtt ttctccttga tatcattcag 1380 
catctactct aaagaaaaat gcctgtctct agctgttctg gactacaccc aagcctgatc 1440 
cagcctttcc aagtcactag aagtcctgct ggatcttgcc taaatcccaa gaaatggaat 1500 
caggtagact tttaatatca ctaatttctt ctttagatgc caaaccacaa gactctttgg 1560 
gtccattcag atgaatagat ggaatttgga acaatagaat aatctattat ttggagcctg 1620 
ccaagaggta ctgtaatggg taattctgac gtcagcgcac caaaactatc ctgattccaa 1680 
atatgtatgc acctcaaggt catcaaacat ttgccaagtg agttgaatag ttgcttaatt 1740 
ttgattttta atggaaagtt gtatccatta acctgggcat tgttgaggtt aagtttctct 1800 
tcacccctac actgtgaagg gtacagatta ggtttgtccc agtcagaaat aaaatttgat 1860 
aaacattcct gttgatggga aaagccccca gttaatactc cagagacagg gaaaggtcag 1920 
cccgtttcag aaggaccaat tgactctcac actgaatcag ctgctgactg gcagggcttt 1980 
gggcagttgg ccaggctctt ccttgaatct tctcccttgt cctgcttggg gttcatagga 2040 
attggtaagg cctctggact ggcctgtctg gcccctgaga gtggtgccct ggaacactcc 2100 
tctactctta cagagccttg agagacccag ctgcagacca tgccagaccc actgaaatga 2160 
ccaagacagg ttcaggtagg ggtgtgggtc aaaccaagaa gtgggtgccc ttggtagcag 2220 
cctggggtga cctctagagc tggaggctgt gggactccag gggcccccgt gttcaggaca 2280 
catctattgc agagactcat ttcacagcct ttcgttctgc tgaccaaatg gccagttttc 2340 
tggtaggaag atggaggttt accggttgtt tagaaacaga aatagactta ataaaggttt 2400 
aaagctgaag aggttgaagc taaaaggaaa aggttgttgt taatgaatat caggctatta 2460 
tttattgtat taggaaaata taatatttac tgttagaatt cttttattta gggccttttc 2520 
tgtgccagac attgctctca gtgctttgca tgtattagct cactgaatct tcacgacaat 2580 
gttgagaagt tcccattatt atttctgttc ttacaaatgt gaaacggaag ctcatagagg 2640 
tgagaaaact caaccagagt cacccagttg gtgactggga aagttaggat tcagatcgaa 2700 
attggactgt ctttataacc catattttcc ccctgttttt agagcttcca aatgtgtcag 2760 
aataggaaaa cattgcaata aatggcttga ttttttaaaa aaaaaaaaaa aaaaaaaaa 2819 

<210> 331 

<211> 2540 - 

<212> DNA 

<213> Homo sapiens 

<400> 331 

gaaaaggtgg acaagtccta ttttcaagag aagatgactt ttaacagttt tgaaggatct 60 
aaaacttgtg tacctgcaga catcaataag gaagaagaat ttgtagaaga gtttaataga 120 
ttaaaaactt ttgctaattt tccaagtggt agtcctgttt cagcatcaac actggcacga 180 
gcagggtttc tttatactgg tgaaggagat accgtgcggt gctttagttg tcatgcagct 240 
gtagatagat ggcaatatgg agactcagca gttggaagac acaggaaagt atccccaaat 300 
tgcagattta tcaacggctt ttatcttgaa aatagtgcca cgcagtctac aaattctggt 360 
atccagaatg gtcagtacaa agttgaaaac tatctgggaa gcagagatca ttttgcctta 420 
gacaggccat ctgagacaca tgcagactat cttttgagaa ctgggcaggt tgtagatata 480 
tcagacacca tatacccgag gaaccctgcc atgtattgtg aagaagctag attaaagtcc 540 
tttcagaact ggccagacta tgctcaccta accccaagag agttagcaag tgctggactc 600 
tactacacag gtattggtga ccaagtgcag tgcttttgtt gtggtggaaa actgaaaaat 660 
tgggaacctt gtgatcgtgc ctggtcagaa cacaggcgac actttcctaa ttgcttcttt 720 
gttttgggcc ggaatcttaa tattcgaagt gaatctgatg ctgtgagttc tgataggaat 780 
ttcccaaatt caacaaatct tccaagaaat ccatccatgg cagattatga agcacggatc 840 
tttacttttg ggacatggat atactcagtt aacaaggagc agcttgcaag agctggattt 900 
tatgctttag gtgaaggtga taaagtaaag tgctttcact gtggaggagg gctaactgat 960 
tggaagccca gtgaagaccc ttgggaacaa catgctaaat ggtatccagg gtgcaaatat 1020 
ctgttagaac agaagggaca agaatatata aacaatattc atttaactca ttcacttgag 1080 
gagtgtctgg taagaactac tgagaaaaca ccatcactaa ctagaagaat tgatgatacc 1140 
atcttccaaa atcctatggt acaagaagct atacgaatgg ggttcagttt caaggacatt 1200 
aagaaaataa tggaggaaaa aattcagata tctgggagca actataaatc acttgaggtt 1260 
ctggttgcag atctagtgaa tgctcagaaa gacagtatgc aagatgagtc aagtcagact 1320 
tcattacaga aagagattag tactgaagag cagctaaggc gcctgcaaga ggagaagctt 1380 
tgcaaaatct gtatggatag aaatattgct atcgtttttg ttccttgtgg acatctagtc 1440 
acttgtaaac aatgtgctga agcagttgac aagtgtccca tgtgctacac agtcattact 1500 
ttcaagcaaa aaatttttat gtcttaatct aactctatag taggcatgtt atgttgttct 1560 
tattaccctg attgaatgtg tgatgtgaac tgactttaag taatcaggat tgaattccat 1620 
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tagcatttgc 
atctttgaat 
tttaattgaa 
attcatagta 
tcttttcaga 
atctccccaa 
gaaagataga 
tgtaaaggga 
aaagcgtatt 
gagatatgtt 
aagtatgtat 
ttaaatgtgg 
gaggggcctt 
gtagaccccg 
gtgctcttcc 
ctgagtgctg 



taccaagtag 
ttcttgattt 
accatagact 
tactgattta 
taggcttaac 
tcacataatt 
gattgttttt 
taaacacgga 
taatgataga 
aagtgtaaaa 
gttgttaata 
tttctcttcg 
ttcactttcg 
aagggtttta 
tagggagctg 
gggcactttg 



39740-000lPCT.txt 
gaaaaaaaat gtacatggca gtgttttagt 
ttcagggtat tagctgtatt atccattttt 
aagaataaga agcatcatac tataactgaa 
atttctaagt gtaagtgaat taatcatctg 
aaatggagct ttctgtatat aaatgtggag 
tgttttgtgt gaaaaaggaa taaattgttc 
agaggttgjgt tgttgtgttt taggattctg 
cgtgtgcgaa atatgtttgt aaagtgattt 
atactatcga gccaacatgt actgacatgg 
tgcaagtggc gggacactat gtatagtctg 
tgcatagaac gagagatttg gaaagatata 
gggagggggg gattggggga ggggccccag 
acttttttca ttttgttctg ttcggatttt 
tgggaactaa catcagtaac ctaacccccg 
tgttgtttcc cacccaccac ccttccctct 



tggcaatata 
tttactgtta 
cacaatgtgt 
gattttttat 
attagagtta 
catgctggtg 
tccattttct 
gccattgttg 
aaagatgtca 
agccagatca 
caccaaactg 
aggggtttta 
ttataagtat 
tgactatcct 
gaacaaatgc 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2540 



<210> 332 

<211> 1474 

<212> DNA 

<213> Homo sapiens 



<400> 332 

aaaaagaaat 

aatttaactg 

gagctgaccc 

accccggaga 

agcgaggccg 

gccgacacca 

atcggcggcg 

aaatggttca 

gtatttgtac 

ggagatggag 

aatgttacag 

cattatagac 

aatagtgaga 

aacgccggcc 

cgaccacagt 

ggtgcaggag 

tccgcagggg 

gaaaatcaag 

ttcaattacc 

gcagccgatc 

taaatgccgg 

gaaattccag 

tttgcccgtt 

ttatgtttta 

ctggtttttc 



caagaatgca 
aggatgtaaa 
cagagagccc 
gcccgcagct 
agacccagca 
agcccggcac 
ctggcgcggg 
atgtaaggaa 
accagactgc 
agactgtgga 
gtcctggtgg 
gctatccacg 
gtggggaaaa 
ctacgcaggc 
attccaaccc 
aacaaggtag 
ccctcctcgc 
gagatgagac 
gacgcagacg 
caccagctga 
cttaccatct 
caataagaaa 
gaccagataa 
cctaaagacg 
tcaatacgcc 



attttattta 
agacctctac 
tgggcagccc 
gccgcagccg 
gccgcccgcc 
taccggagcg 
cgacaagaag 
cggatatggt 
cataaagaag 
gtttgatgtt 
tgttccagtt 
tcgtaggggt 
gaacgaggga 
gaaggttccc 
tcctgtgcag 
accagtgagg 
caaaagacag 
ccaaggtcag 
cccagaaaac 
gaattcgtcc 
ctaccatcat 
tgaacaaaag 
atagaactat 
tctctttttg 
tttaaaggaa 



caatagtcac 
aaggagagtt 
cacctccgcc 
gccccagtca 
gccccccccg 
gcgcagggag 
gtcatcgcaa 
ttcatcaaca 
aataacccca 
gttgaaggag 
caaggcagta 
cctccacgca 
tcggagagtg 
accttactac 
ggagaagtga 
cagatatgta 
cctagagagg 
cagccacctc 
cctaaaccac 
gctcccgagg 
ccggtttagt 
attggagctg 
ctgcattatc 
gtaataacaa 
ttcc 



gccggaaata 
caatgcgtag 
gccggcctag 
ccatcaccgc 
ccgcccccgc 
cggtggcccg 
cgaaggtttt 
ggaatgacac 
ggaagtacct 
aaaagggtgc 
aatatgcagc 
attaccagca 
ctcccgaagc 
atgcggagac 
tggagggtgc 
tcggggatat 
acggcaatga 
aagctcggta 
aagatggcaa 
ctgagcaggg 
catccaacaa 
aagacctaaa 
tatgcagcat 
accgtgtttt 



cctagaaata 
cgggagcgga 
ttaccatcac 
aaccatgagc 
cctcagcgcc 
ggcggctcac 
gggaacagta 
caaggaagat 
tcgcagtgta 
ggaggcagca 
agaccgtaac 
aaattaccag 
caggcccaac 
ctatgggcgt 
tgacaaccag 
agaccacgat 
agaagataaa 
ccgccgcaac 
agagacaaaa 
cggggctgag 
gaagaaatat 
gtgcttgctt 
ggggttttta 
ttaaaaaagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1474 



<210> 333 

<211> 4079 

<212> DNA 

<213> Homo sapiens 



<400> 333 

ggagcggcgg 

aacgagactt 

tttcccttgc 

gcgacccttg 

tcttctcacg 

agccaggcag 

agagctggct 

gcactggagg 

acatcgacca 

gtgttcggag 

ggcagcgagt 

gccaggacct 

gcccccaagg 



gcgggcggga 
tggagaccag 
attcgctggg 
gggggcctcc 
ggctgtcgcc 
cctcttcggc 
gctggtgggt 
cctcttcgct 
gggagctgat 
ccaggggcct 
ggaccagatc 
ggccatccgg 
ccatgaacaa 



gggctggcgg 
agacgcgcct 
agctcgcgca 
gggatttgct 
ttcaatctgg 
ttctctgtgg 
gctccccagg 
tgcccgttga 
atgcaaaagg 

gggggcaaga 

ctggagacgc 
gatgagttgg 
tttgggttct 



ggcgaacgtc tgggagacgt 

ggggggacct ggggcttggg 
gggatcgtcc catggccggg 
acctttttgg ctccctgctc 
acgtgatggg tgccttgcgc 
ccctgcaccg gcagttgcag 
ccctggctct tcctgggcag 
gcctggagga gactgactgc 
aaagcaagga gaaccagtgg 
ttgttacctg tgcacaccga 
gggatatgat tggtcgctgc 
stggtgggga atggaagttc 
gccagcaggg cacagctgcc 
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ctgaaagacc 60 
gcgtgcgaga 120 
gctcggagcc 180 
gtcgaactgc 240 
aaggagggcg 300 
ccccgacccc 360 
caggcgaatc 420 
tacagagtgg 480 
ttgggagtca 540 
tatgaggcaa 600 
tttgtgctca 660 
tgtgagggac 720 
gccttctccc 780 



wo 2003/078662 



PCT/US2003/007713 



39740-000lPCT.txt 

ctgatagcca ctacctcctc tttggggccc caggaaccta taattggaag gggttgcttt 840 
ttgtgaccaa cattgatagc tcagaccccg accagctggt gtataaaact ttggaccctg 900 
ctgaccggct cccaggacca gccggagact tggccctcaa tagctactta ggcttctcta 960 
ttgactcggg gaaaggtctg gtgcgtgcag aagagctgag ctttgtggct ggagcccccc 1020 
gcgccaacca caagggtgct gtggttatcc tgcgcaagga cagcgccagt cgcctggtgc 1080 
ccgaggttat gctgtctggg gagcgcctga cctccggctt tggctactca ctggctgtgg 1140 
ctgacctcaa cagtgatggc tggccagacc tgatagtggg tgccccctac ttctttgagc 1200 
gccaagaaga gctggggggt gctgtgtatg tgtacttgaa ccaggggggt cactgggctg 1260 
ggatctcccc tctccggctc tgcggctccc ctgactccat gttcgggatc agcctggctg 1320 
tcctggggga cctcaaccaa gatggctttc cagatattgc agtgggtgcc ccctttgatg 1380 
gtgatgggaa agtcttcatc taccatggga gcagcctggg ggttgtcgcc aaaccttcac 1440 
aggtgctgga gggcgaggct gtgggcatca agagcttcgg ctactccctg tcaggcagct 1500 
tggatatgga tgggaaccaa taccctgacc tgctggtggg ctccctggct gacaccgcag 1560 
tgctcttcag ggccagaccc atcctccatg tctcccatga ggtctctatt gctccacgaa 1620 
gcatcgacct ggagcagccc aactgtgctg gcggccactc ggtctgtgtg gacctaaggg 1680 
tctgtttcag ctacattgca gtccccagca gctatagccc tactgtggcc ctggactatg 1740 
tgttagatgc ggacacagac cggaggctcc ggggccaggt tccccgtgtg acgttcctga 1800 
gccgtaacct ggaagaaccc aagcaccagg cctcgggcac cgtgtggctg aagcaccagc 1860 
atgaccgagt ctgtggagac gccatgttcc agctccagga aaatgtcaaa gacaagcttc 1920 
gggccattgt agtgaccttg tcctacagtc tccagacccc tcggctccgg cgacaggctc 1980 
ctggccaggg gctgcctcca gtggccccca tcctcaatgc ccaccagccc agcacccagc 2040 
gggcagagat ccacttcctg aagcaaggct gtggtgaaga caagatctgc cagagcaatc 2100 
tgcagctggt ccacgcccgc ttctgtaccc gggtcagcga cacggaattc caacctctgc 2160 
ccatggatgt ggatggaaca acagccctgt ttgcactgag tgggcagcca gtcattggcc 2220 
tggagctgat ggtcaccaac ctgccatcgg acccagccca gccccaggct gatggggatg 2280 
atgcccatga agcccagctc ctggtcatgc ttcctgactc actgcactac tcaggggtcc 2340 
gggccctgga ccctgcggag aagccactct gcctgtccaa tgagaatgcc tcccatgttg 2400 
agtgtgagct ggggaacccc atgaagagag gtgcccaggt caccttctac ctcatcctta 2460 
gcacctccgg gatcagcatt gagaccacgg aactggaggt agagctgctg ttggccacga 2520 
tcagtgagca ggagctgcat ccagtctctg cacgagcccg tgtcttcatt gagctgccac 2580 
tgtccattgc aggaatggcc attccccagc aactcttctt ctctggtgtg gtgaggggcg 2640 
agagagccat gcagtctgag cgggatgtgg gcagcaaggt caagtatgag gtcacggttt 2700 
ccaaccaagg ccagtcgctc agaaccctgg gctctgcctt cctcaacatc atgtggcctc 2760 
atgagattgc caatgggaag tggttgctgt acccaatgca ggttgagctg gagggcgggc 2820 
aggggcctgg gcagaaaggg ctttgctctc ccaggcccaa catcctccac ctggatgtgg 2880 
acagtaggga taggaggcgg cgggagctgg agccacctga gcagcaggag cctggtgagc 2940 
ggcaggagcc cagcatgtcc tggtggccag tgtcctctgc tgagaagaag aaaaacatca 3000 
ccctggactg cgcccggggc acggccaact gtgtggtgtt cagctgccca ctctacagct 3060 
ttgaccgcgc ggctgtgctg catgtctggg gccgtctctg gaacagcacc tttctggagg 3120 
agtactcagc tgtgaagtcc ctggaagtga ttgtccgggc caacatcaca gtgaagtcct 3180 
ccataaagaa cttgatgctc cgagatgcct ccacagtgat cccagtgatg gtatacttgg 3240 
accccatggc tgtggtggca gaaggagtgc cctggtgggt catcctcctg gctgtactgg 3300 
ctgggctgct ggtgctagca ctgctggtgc tgctcctgtg gaagatggga ttcttcaaac 3360 
gggcgaagca ccccgaggcc accgtgcccc agtaccatgc ggtgaagatt cctcgggaag 3420 
accgacagca gttcaaggag gagaagacgg gcaccatcct gaggaacaac tggggcagcc 3480 
cccggcggga gggcccggat gcacacccca tcctggctgc tgacgggcat cccgagctgg 3540 
gccccgatgg gcatccaggg ccaggcaccg cctaggttcc catgtcccag cctggcctgt 3600 
ggctgccctc catcccttcc ccagagatgg ctccttggga tgaagagggt agagtgggct 3660 
gctggtgtcg catcaagatt tggcaggatc ggcttcctca ggggcacaga cctctcccac 3720 
ccacaagaac tcctcccacc caacttcccc ttagagtgct gtgagatgag agtgggtaaa 3780 
tcagggacag ggccatgggg tagggtgaga agggcagggg tgtcctgatg caaaggtggg 3840 
gagaagggat cctaatccct tcctctccca ttcaccctgt gtaacaggac cccaaggacc 3900 
tgcctccccg gaagtgcctt aacctagagg gtcggggagg aggttgtgtc actgactcag 3960 
gctgctcctt ctctagtttc ccctctcatc tgaccttagt ttgctgccat cagtctagtg 4020 
gtttcgtggt ttcgtctatt tattaaaaaa tatttgaga.a caaaaaaaaa aaaaaaaaa 4079 

<210> 334 

<211> 3373 

<212> DNA 

<213> Homo sapiens 

<400> 334 

ggtggcaact tctcctcctg cggecgggag cggcctgcct gcctccctgc gcacccgcag 60 
cctcccccgc tgcctcccta gggctcccct ccggccgcca gcgcccattt ttcattccct 120 
agatagagat actttgcgcg cacacacata catacgcgcg caaaaaggaa aaaaaaaaaa 180 
aaaagcccac cctccagcct cgctgcaaag agaaaaccgg agcagccgca gctcgcagct 240 
cgcagctcgc agcccgcagc ccgcagagga cgcccagagc ggcgagcagg cgggcagacg 300 
gaccgacgga ctcgcgccgc gtccacctgt cggccgggcc cagccgagcg cgcagcgggc 360 
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acgccgcgcg 
tcccgccccc 
ctgccgttgc 
cccagctccg 
ccggcgggca 
ttccttttcc 
ctggttcgcg 
gtagccggcc 
ccggggccgg 
gcgcctgatc 
cgcgggggcg 
gcatgaatcg 
ccgaggggga 
cctttgatga 
tggacctgaa 
gaaggagcct 
gcaccgaggt 
tgtggccgcc 
agtgccgccc 
ggaagaagcc 
gtgagacagt 
gagccaaaac 
agggcaagca 
gagcctaggg 
tattgccccc 
gcctgattcg 
atcagcgggt 
aggactgaac 
agagagactg 
ggccacacct 
gcctgatccc 
cctcggactc 
ggtgcccaaa 
gttttcgttt 
ggtgccctct 
tgggtatccc 
cctctggttc 
agtctgctcc 
tcctctctag 
acccagccac 
gcagggagca 
tggcagatgt 
ggactggctg 
ggtaggggca 
tccagcaact 
tgcagccctc 
agctccacct 
aagggaaaag 
ctactgcact 
tgattatatt 
tatttaaagt 



cgcggagcag 
ctacccggcc 
aaagccaact 
cgctttccga 
gaggaaaacg 
tcttggaggt 
acgcagcccc 
tcggacccag 
gggcggcggc 
gccgcggacc 
gcgcgctcga 
ctgctgggcg 
ccccattccc 
tctccaacgc 
catgacccgc 
gggttccctg 
gttcgagatc 
ctgtgtggag 
cacccaggtg 
aatctttaag 
ggcagctgca 
gccccaaact 
ccggaaattc 
gcatcggcag 
atggggtcct 
gacggccaat 
ctcctcccag 
tccatcgcca 
atggggtcgc 
gagcgctgtg 
tgaacccctg 
catggctaag 
caccagtcac 
tgaagacgtg 
caggtgggtt 
ctgcctcctg 
ctttcatctc 
accaaggggc 
ggcagaagac 
gactgccatg 
ctgttcacat 
aggtaaagag 
tgcccctgat 
gggaggcaac 
gcccttccag 
ttgcccgcac 
ctggtggctc 
atccccaaga 
ttcccccttc 
tttgggggct 
tgt 



ccgtgcccgc 
cgggcgggag 
ttggaaaaag 
ttttgggggc 
cctgtagccg 
tggagtcccc 
ccggccgtgg 
gtcctgcgcc 
gccgggggca 
tgagccgagc 
tctacgcgtc 
ctcttcctgt 
gaggagcttt 
ctgctgcacg 
tcccactctg 
accattgctg 
tcccggcgcc 
gtgcagcgct 
cagctgcgac 
aaggccacgg 
cggcctgtga 
cgggtgacca 
aagcacacgc 
gagagtgtgt 
tggagtgata 
ggtgcttccc 
cggectccgg 
tcttcttccc 
tctttggggg 
gactgtcctg 
gccagctctg 
accacagacg 
ctcgtctccc 
gactcctctt 
agagatggag 
ccaccccttc 
tctacctcca 
tcttgaaccc 
tagaggccag 
taaggttgtg 
catagataaa 
gaaaaggatc 
gcagcctgtg 
actgctgtcc 
gtgggtgtgg 
ccctGcctgc 
ctcctaggaa 
ccccctgggg 
ccgccttcca 
ttccttttat 



cgcccgggcc 
tttgcacctc 
ttttttgggg 
ctttccagaa 
gcgagtgaag 
tgggcgcccc 
atgctgcact 
caggtcctcc 
tgcgggtgag 
ccacccccct 
cggggccccg 
ctctctgctg 
atgagatgct 
gagaccccgg 
gaggcgagct 
agccggccat 
tcatagaccg 
gctccggctg 
ctgtccaggt 
tgacgctgga 
cccgaagccc 
ttcggacggt 
atgacaagac 
gggcagggtt 
atattgtttc 
ccacccctcc 
tcttgcccag 
ttaactccaa 
aaacgggttc 
aggagccctg 
aggggaggca 
ggcacacaga 
tggtgcctct 
ggtgggtgtg 
tttgctgttg 
ctccccatac 
ccctgcattt 
cttattaagg 
ggcagcaagg 
cagggtgtgt 
gctgatttgt 
cttttcctaa 
gctggagtgg 
acatgacctc 
gacacctggg 
ttgcacactt 
accagctcgt 
tgggatctga 
aaacctgctt 
tttttaaatg 



cgccgccagg 
tccctgcccg 
gagacttggg 
aatgttgcaa 
acgaaccat:c 
cacacggcta 
cgggctcggg 
cctgcccccc 
ccgcggctgc 
ccccagcccc 
cggggccggg 
ctacctgcgt 
gagtgaccac 
agaggaagat 
ggagagcttg 
gatcgccgag 
caccaacgcc 
ctgcaacaac 
gagaaagatc 
agaccacctg 

ggggggttcc 

gcgagtccgc 
ggcactgaag 
atttaatatg 
cctcgtccgt 
acgtgtccgt 
cagctcaaag 
gaacttggga 
cttcccctgc 
aggacctctc 
cctccaggca 
ctggagaaaa 
gtgcacagtg 
gccagcacac 
aggtggtgta 
tccactctga 
tcctcttgtc 
ccccagatga 
gacctgctca 
actgcacaag 
atatttatta 
ttcacacaaa 
ccaaatagga 
catttcccaa 
agaaggtctc 
ccccatcttt 
gggctgggaa 
gctcccacct 
ccttcagttt 
taaaatttat 



gcgcacacgc 
ggtgctcgag 
ccttgaggtg 
aaaagctaag 
gactgccgtg 
gacgcctcgg 
atccgcccag 
agcgacggag 
agaggcctga 
ccaccctggc 
cccggagtcg 
ctggtcagcg 
tcgatccgct 
ggggccgagt 
gctcgtggaa 
tgcaagacgc 
aacttcctgg 
cgcaacgtgc 
gagattgtgc 
gcatgcaagt 
caggagcagc 
cggcccccca 
gagacccttg 
gtatttgctg 
ctgtctcgat 
ccacccttcc 
aagaaaaaga 
taagagtgtg 
acctggcctg 
agcatagcct 
ggccaggctg 
cccctcccac 
gcttcttttc 
caagtggctg 
gatggtgacc 
ttcacctctt 
ctggcccttc 
ccccagtcac 
tcatattcca 
gacattgtat 
tgacaatttc 
gactccttgt 
gggagactgt 
agtcctctgc 
caagggaggg 
gatccttctg 
tgggggagag 
cccttcccac 
gtaaagtcgg 
ttatattccg 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3373 



<210> 335 

<211> 2304 

<212> DNA 

<213> Homo sapiens 



<400> 335 

gtccccgcag 

ccccgacctc 

gagcgactcc 

tggaggaaca 

gaaattcgga 

tcacttttac 

ctctgccact 

cctggggaaa 

gcaggtgggc 

aaagccctcc 



cgccgtcgcg 
gccaccatga 
aaaggcagca 
tgtgtgtcca 
gggcagcact 
cgaggaaagg 
gtccttcagc 
cataattact 
ctaaagccgc 
tctcctccag 



ccctcctgcc 
gagccctgct 
atgaacttca 
acaagtactt 
gtgaaataga 
ccagcactga 
aaacgtacca 
gcaggaaccc 
ttgtccaaga 
aagaattaaa 



gcaggccacc gaggccgccg 
ggcgcgcctg cttctctgcg 
tcaagttcca tcgaactgtg 
ctccaaeatt cactggtgca 
taagtcaaaa acctgctatg 
caccatgggc cggccctgcc 
tgcccacaga tctgatgctc 
agacaaccgg aggcgaccct 
gtgcatggtg catgactgcg 
atttcagtgt ggccaaaaga 
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ccgtctagcg 60 
tcctggtcgt 120 
actgtctaaa 180 
actgcccaaa 240 
aggggaatgg 300 
tgccctggaa 360 
ttcagctggg 420 
ggtgctatgt 480 
caganggaaa 540 
ctctgaggcc 600 
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ccgctttaag attattgggg gagaattcac caccatcgag aaccagccct ggtttgcggc 660 

catctacagg aggcaccggg ggggctctgt cacctacgtg tgtggaggca gcctcatcag 720 

cccttgctgg gtgatcagcg ccacacactg cttcattgat tacccaaaga aggaggacta 780 

catcgtctac ctgggtcgct caaggcttaa ctccaacacg caaggggaga tgaagtttga 840 

ggtggaaaac ctcatcctac acaaggacta cagcgctgac acgcttgctc accacaacga 900 

cattgccttg ctgaagatcc gttccaagga gggcaggtgt gcgcagccat cccggactat 960 

acagaccatc tgcctgccct cgatgtataa cgatccccag tttggcacaa gctgtgagat 1020 

cactggcttt ggaaaagaga attctaccga ctatctctat ccggagcagc tgaaaatgac 1080 

tgttgtgaag ctgatttccc accgggagtg tcagcagccc cactactacg gctctgaagt 1140 

caccaccaaa atgctatgtg ctgctgaccc ccaatggaaa acagattcct gccagggaga 1200 

ctcaggggga cccctcgtct gttccctcca aggccgcatg actttgactg gaattgtgag 1260 

ctggggccgt ggatgtgccc tgaaggacaa gccaggcgtc tacacgagag tctcacactt 1320 

cttaccctgg atccgcagtc acaccaagga agagaatggc ctggccctct gagggtcccc 1380 

agggaggaaa cgggcaccac ccgctttctt gctggttgtc atttttgcag tagagtcatc 1440 

tccatcagct gtaagaagag actgggaaga taggctctgc acagatggat ttgcctgtgg 1500 

caccaccagg gtgaacgaca atagctttac cctcacggat aggcctgggt gctggctgcc 1560 

cagaccctct ggccaggatg gaggggtggt cctgactcaa catgttactg accagcaact 1620 

tgtctttttc tggactgaag cctgcaggag ttaaaaaggg cagggcatct cctgtgcatg 1680 

ggctcgaagg gagagccagc tcccccgacc ggtgggcatt tgtgaggccc atggttgaga 1740 

aatgaataat ttcccaatta ggaagtgtaa gcagctgagg tctcttgagg gagcttagcc 1800 

aatgtgggag cagcggtttg gggagcagag acactaacga cttcagggca gggctctgat 1860 

attccatgaa tgtatcagga aatatatatg tgtgtgtatg tttgcacact tgttgtgtgg 1920 

gctgtgagtg taagtgtgag taagagctgg tgtctgattg ttaagtctaa atatttcctt 1980 

aaactgtgtg gactgtgatg ccacacagag tggtctttct ggagaggtta taggtcactc 2040 

ctggggcctc ttgggtcccc cacgtgacag tgcctgggaa tgtacttatt ctgcagcatg 2100 

, acctgtgacc agcactgtct cagtttcact ttcacataga tgtccctttc ttggccagtt 2160 

atcccttcct tttagcctag ttcatccaat cctcactggg tggggtgagg accactcctt 2220 

acactgaata tttatatttc actattttta tttatatttt tgtaatttta aataaaagtg 2280 

atcaataaaa tgtgattttt ctga 2304 

<210> 336 

<211> 1876 

<212> DNA 

<213> Homo sapiens 

<400> 336 

cgcggccgcg gttcgctgtg gcgggcgcct gggccgccgg ctgtttaact tcgcttccgc 60 
tggcccatag tgatctttgc agtgacccag cagcatcact gtttcttggc gtgtgaagat 120 
aacccaagga attgaggaag ttgctgagaa gagtgtgctg gagatgctct aggaaaaaat 180 
tgaatagtga gacgagttcc agcgcaaggg tttctggttt gccaagaaga aagtgaacat 240 
catggatcag aacaacagcc tgccacctta cgctcagggc ttggcctccc ctcagggtgc 300 
catgactccc ggaatcccta tctttagtcc aatgatgcct tatggcactg gactgacccc 360 
acagcctatt cagaacacca atagtctgtc tattttggaa gagcaacaaa ggcagcagca 420 
gcaacaacaa cagcagcagc agcagcagca gcagcagcaa cagcaacagc agcagcagca 480 
gcagcagcag cagcagcagc agcagcagca gcagcagcag caacaggcag tggcagctgc 540 
agccgttcag cagtcaacgt cccagcaggc aacacaggga acctcaggcc aggcaccaca 600 
gctcttccac tcacagactc tcacaactgc acccttgccg ggcaccactc cactgtatcc 660 
ctcccccatg actcccatga cccccatcac tcctgccacg ccagcttcgg agagttctgg 720 
gattgtaccg cagctgcaaa atattgtatc cacagtgaat cttggttgta aacttgacct 780 
aaagaccatt gcacttcgtg cccgaaacgc cgaatataat cccaagcggt ttgctgcggt 840 
aatcatgagg ataagagagc cacgaaccac ggcactgatt ttcagttctg ggaaaatggt 900 
gtgcacagga gccaagagtg aagaacagtc cagactggca gcaagaaaat atgctagagt 960 
tgtacagaag ttgggttttc cagctaagtt cttggacttc aagattcaga acatggtggg 1020 
gagctgtgat gtgaagtttc ctataaggtt agaaggcctt gtgctcaccc accaacaatt 1080 
tagtagttat gagccagagt tatttcctgg tttaatctac agaatgatca aacccagaat 1140 
tgttctcctt atttttgttt ctggaaaagt tgtattaaca ggtgctaaag tcagagcaga 1200 
aatttatgaa gcatttgaaa acatctaccc tattctaaag ggattcagga agacgacgta 1260 
atggctctca tgtacccttg cctcccccac ccccttcttt tttttttttt aaacaaatca 1320 
gtttgttttg gtacctttaa atggtggtgt tgtgagaaga tggatgttga gttgcagggt 1380 
gtggcaccag gtgatgccct tctgtaagtg cccaccgcgg gatgccggga aggggcatta 1440 
tttgtgcact gagaacaccg cgcagcgtga ctgtgagttg ctcataccgt gctgctatct 1500 
gggcagcgct gcccatttat ttatatgtag attttaaaca ctgctgttga caagttggtt 1560 
tgagggagaa aactttaagt gttaaagcca cctctataat tgattggact ttttaatttt 1620 
aatgtttttc cccatgaacc acagttttta tatttctacc agaaaagtaa aaatcttttt 1680 
taaaagtgtt gtttttctaa tttataactc ctaggggtta tttctgtgcc agacacattc 1740 
cacctctcca gtattgcagg acggaatata tgtgttaatg aaaatgaatg gctgtacata 1800 
tttttttctt tcttcagagt actctgtaca ataaatgcag tttataaaag tgttaaaaaa 1860 
aaaaaaaaaa aaaaaa 1876 
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<210> 337 

<211> 6633 

<212> DNA 

<213> Homo sapiens 



<400> 337 

ttctccccgc 

gtaagagcaa 

gagagaaact 

tttacttaga 

aacgcggttt 

aagagatcat 

cggaataaca 

ggtcttaggc 

tatccttcca 

ctttggggag 

ggaaatcaga 

tgcctcggcg 

gaatgagctt 

acctctagga 

ttgtcgcaca 

tgcctcctac 

tgaggccacc 

agcaacatca 

aacgattgtg 

ccctggagaa 

caaattggtg 

atgtgctgcc 

ccatgagaaa 

gcatgaagtc 

gctgaaaaac 

gattcaggaa 

tggccattat 

gttaactcaa 

gggacagacg 

atgcaaagat 

ctcaaacatc 

tttcgccaaa 

tgagaaccga 

tgcagtcctg 

gaaacagaaa 

acatgaatat 

aagagatgga 

aggaacagcc 

aaaacccacg 

tcacctgggg 

catttacatc 

tagggatagc 

attgaaccct 

tgactacatg 

agaggtttct 

gaagaaatct 

ccttacttta 

ggcttcaaaa 

aaaaattgtg 

tgtgtcgaaa 

caacctctac 

ttcccttggt 

caagagtggg 

ggtgaaatgc 

tgtggagaat 

cctgaagagt 

tggtgtcacc 

gcagagactg 

tgaggaggag 

cattgagacg 

catcgacatg 



cccccagttg 
aaagcgaagg 
tttattttga 
gcaaatgatt 
ttgagcccat 
ttgggaaggc 
tcggaggaga 
tgtcttctca 
aatgaaaatg 
agtgaagtga 
aatgaagaaa 
gcccacacag 
gaaggcaggc 
atgacggatt 
actgatcccg 
gacagcagac 
gtcaaaggaa 
gagctggatc 
gtcacctgtg 
gtgaaaggca 
tacactttga 
cgccaggcta 
ggtttcattg 
aaacattttg 
aatctgactc 
ataaggtatc 
actattgtag 
gttccttcat 
gtgaggtgca 
attaagaaat 
atcacggaga 
gtggaggaga 
gagctgaagc 
gtgctgttgg 
ccgaggtatg 
atttatgtgg 
ctagtgcttg 
tatggattaa 
gccagatcca 
ccacatttga 
atcacagagt 
ttcctgagcc 
gctgatgaaa 
gacatgaagc 
aaatattccg 
atgttagact 
ttggatttgt 
aattgtgtcc 
aagatctgtg 
ggcagtacct 
accacactga 
ggcacccctt 
taccggatgg 
tggaacagtg 
ctgctgcctg 
gaccatcctg 
tacaaaaacg 
agcgctgaca 
gacctgggca 
ggttccagca 
atggacgaca 



ttgtcgaagt ctgggggttg ggactggacc ccctgattgc 60 
cgcaatctgg acactgggag attcggagcg cagggagttt 120 
agagaccaag gttgaggggg ggcttatttc ctgacagcta 180 
agttttagaa ggatggacta taacattgaa tcaattacaa 240 
tactgttgga gctacaggga gagaaacagg aggagactgc 300 
cgtgggcacg ctctttactc catgtgtggg acattcattg 360 
agtttcccag agctatgggg acttcccatc cggcgttcct 420 
cagggctgag cctaatcctc tgccagcttt cattaccctc 480 
aaaaggttgt gcagctgaat tcatcctttt ctctgagatg 540 
gctggcagta ccccatgtct gaagaagaga gctccgatgt 600 
acaacagcgg cctttttgtg acggtcttgg aagtgagcag 660 
ggttgtacac ttgctattac aaccacactc agacagaaga 720 
acatttacat ctatgtgcca gacccagatg tagcctttgt 780 
atttagtcat cgtggaggat gatgattctg ccattatacc 840 
agactcctgt aaccttacac aacagtgagg gggtggtacc 900 
agggctttaa tgggaccttc actgtagggc cctatatctg 960 
agaagttcca gaccatccca tttaatgttt atgctttaaa 1020 
tagaaatgga agctcttaaa accgtgtata agtcagggga 1080 
ctgtttttaa caatgaggtg gttgaccttc aatggactta 1140 
aaggcatcac aatgctggaa gaaatcaaag tcccatccat 1200 
cggtccccga ggccacggtg aaagacagtg gagattacga 1260 
ccagggaggt caaagaaatg aagaaagtca ctatttctgt 1320 
aaatcaaacc caccttcagc cagttggaag ctgtcaacct 1380 
ttgtagaggt gcgggcctac ccacctccca ggatatcctg 1440 
tgattgaaaa tctcactgag atcaccactg atgtggaaaa 1500 
gaagcaaatt aaagctgatc cgtgctaagg aagaagacag 1560 
ctcaaaatga agatgctgtg aagagctata cttttgaact 1620 
ccattctgga cttggtcgat gatcaccatg gctcaactgg 1680 
cagctgaagg cacgccgctt cctgatattg agtggatgat 1740 
gtaataatga aacttcctgg actattttgg ccaacaatgt 1800 
tccactcccg agacaggagt accgtggagg gccgtgtgac 1860 
ccatcgccgt gcgatgcctg gctaagaatc tccttggagc 1920 
tggtggctcc caccctgcgt tctgaactca cggtggctgc 1980 
tgattgtgat catctcactt attgtcctgg ttgtcatttg 2040 
aaattcgctg gagggtcatt gaatcaatca gcccggatgg 2100 
acccgatgca gctgccttat gactcaagat gggagtttcc 2160 
gtcgggtctt ggggtctgga gcgtttggga aggtggttga 2220 
gccggtccca acctgtcatg aaagttgcag tgaagatgct 2280 
gtgaaaaaca agctctcatg tctgaactga agataatgac 2340 
acattgtaaa cttgctggga gcctgcacca agtcaggccc 2400 
attgcttcta tggagatttg gtcaactatt tgcataagaa 2460 
accacccaga gaagccaaag aaagagctgg atatctttgg 2520 
gcacacggag ctatgttatt ttatcttttg aaaacaatgg 2580 
aggctgatac tacacagtat gtccccatgc tagaaaggaa 2640 
acatccagag atcactctat gatcgtccag cctcatataa 2700 
cagaagtcaa aaacctcctt tcagatgata aqtcagaagg 2760 
tgagcttcac ctatcaagtt gcccgaggaa tggagttttt 2820 
accgtgatct ggctgctcgc aacgtcctcc tggcacaagg 2880 
actttggcct ggccagagac atcatgcatg attcgaacta 2940 
ttctgcccgt gaagtggatg gctcctgaga gcatctttga 3000 
gtgatgtctg gtcttatggc attctgctct gggagatctt 3060 
accccggcat gatggtggat tctactttct acaataagat 3120 
ccaagcctga ccacgctacc agtgaagtct acgagatcat 3180 
agccggagaa gagaccctcc ttttaccacc tgagtgagat 3240 
gacaatataa aaagagttat gaaaaaattc acctggactt 3300 
ctgtggcacg catgcgtgtg gactcagaca atgcatacat 3360 
aggaagacaa gctgaaggac tgggagggtg gtctggatga 3420 
gtggctacat cattcctctg cctgacattg accctgtccc 3480 
agaggaacag acacagctcg cagacctctg aagagagtgc 3540 
gttccacctt catcaagaga gaggacgaga ccattgaaga 3600 
tcggcataga ctcttcagac cnggtggaag acagcttcct 3660 
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gtaactggcg gattcgaggg gttccttcca cttctggggc cacctctgga tcccgttcag 3720 
aaaaccactt tattgcaatg cggaggttga gaggaggact tggttgatgt ttaaagagaa 3780 
gttcccagcc aagggcctcg gggagcgttc taaatatgaa tgaatgggat attttgaaat 3840 
gaactttgtc agtgttgcct ctcgcaatgc ctcagtagca tctcagtggt gtgtgaagtt 3900 
tggagataga tggataaggg aataataggc cacagaaggt gaactttgtg cttcaaggac 3960 
attggtgaga gtccaacaga cacaatttat actgcgacag aacttcagca ttgtaattat 4020 
gtaaataact ctaaccaagg ctgtgtttag attgtattaa ctatcttctt tggacttctg 4080 
aagagaccac tcaatccatc catgtacttc cctcttgaaa cetgatgtca gctgctgttg 4140 
aactttttaa agaagtgcat gaaaaaccat ttttgaacct taaaaggtac tggtactata 4200 
gcattttgct atctttttta gtgttaagag ataaagaata ataattaacc aaccttgttt 4260 
aatagatttg ggtcatttag aagcctgaca actcattttc atattgtaat ctatgtttat 4320 
aatactacta ctgttatcag taatgctaaa tgtgtaataa tgtaacatga tttccctcca 4380 
gagaaagcac aatttaaaac aatccttact aagtaggtga tgagtttgac agtttttgac 4440 
atttatatta aataacatgt ttctctataa agtatggtaa tagctttagt gaattaaatt 4500 
tagttgagca tagagaacaa agtaaaagta gtgttgtcca ggaagtcaga atttttaact 4560 
gtactgaata ggttccccaa tccatcgtat taaaaaacaa ttaactgccc tctgaaataa 4620 
tgggattaga aacaaacaaa actcttaagt cctaaaagtt ctcaatgtag aggcataaac 4680 
ctgtgctgaa cataacttct catgtatatt acccaatgga aaatataatg atcagcaaaa 4740 
agactggatt tgcagaagtt tttttttttt ttcttcatgc ctgatgaaag ctttggcaac 4800 
cccaatatat gtattttttg aatctatgaa cctgaaaagg gtcagaagga tgcccagaca 4860 
tcagcctcct tctttcaccc cttaccccaa agagaaagag tttgaaactc gagaccataa 4920 
agatattctt tagtggaggc tggatgtgca ttagcctgga tcctcagttc tcaaatgtgt 4980 
gtggcagcca ggatgactag atcctgggtt tccatccttg agattctgaa gtatgaagtc 5040 
tgagggaaac cagagtctgt atttttctaa actccctggc tgttctgatc ggccagtttt 5100 
cggaaacact gacttaggtt tcaggaagtt gccatgggaa acaaataatt tgaactttgg 5160 
aacagggttg gaattcaacc acgcaggaag cctactattt aaatccttgg cttcaggtta 5220 
gtgacattta atgccatcta gctagcaatt gcgaccttaa tttaactttc cagtcttagc 5280 
tgaggctgag aaagctaaag tttggttttg acaggttttc caaaagtaaa gatgctactt 5340 
cccactgtat gggggagatt gaactttccc cgtctcccgt cttctgcctc ccactccata 5400 
ccccgccaag gaaaggcatg tacaaaaatt atgcaattca gtgttccaag tctctgtgta 5460 
accagctcag tgttttggtg gaaaaaacat tttaagtttt actgataatt tgaggttaga 5520 
tgggaggatg aattgtcaca tctatccaca ctgtcaaaca ggttggtgtg ggttcattgg 5580 
cattctttgc aatactgctt aattgctgat accatatgaa tgaaacatgg gctgtgatta 5640 
ctgcaatcac tgtgctatcg gcagatgatg ctttggaaga tgcagaagca ataataaagt 5700 
acttgactac ctactggtgt aatctcaatg caagccccaa ctttcttatc caactttttc 5760 
atagtaagtg cgaagactga gccagattgg ccaattaaaa acgaaaacct gactaggttc 5820 
tgtagagcca attagacttg aaatacgttt gtgtttctag aatcacagct caagcattct 5880 
gtttatcgct cactctccct tgtacagcct tattttgttg gtgctttgca ttttgatatt 5940 
gctgtgagcc ttgcatgaca tcatgaggcc ggatgaaact tctcagtcca gcagtttcca 6000 
gtcctaacaa atgctcccac ctgaatttgt atatgactgc atttgtgggt gtgtgtgtgt 6060 
tttcagcaaa ttccagattt gtttcctttt ggcctcctgc aaagtctcca gaagaaaatt 6120 
tgccaatctt tcctactttc tatttttatg atgacaatca aagccggcct gagaaacact 6180 
atttgtgact ttttaaacga ttagtgatgt ccttaaaatg tggtctgcca atctgtacaa 6240 
aatggtccta tttttgtgaa gagggacata agataaaatg atgttataca tcaatatgta 6300 
tatatgtatt tctatataga cttggagaat actgccaaaa catttatgac aagctgtatc 6360 
actgccttcg tttatatttt tttaactgtg ataatcccca caggcacatt aactgttgca 6420 
cttttgaatg tccaaaattt atattttaga aataataaaa agaaagatac ttacatgttc 6480 
ccaaaacaat ggtgtggtga atgtgtgaga aaaactaact tgatagggtc taccaataca 6540 
aaatgtatta cgaatgcccc tgttcatgtt tttgttttaa aacgtgtaaa tgaagatctt 6600 
tatatttcaa taaatgatat ataatttaaa gtt 6633 



<210> 338 

<211> 994 

<212> DNA = 

<213> Homo sapiens 



<400> 338 

tgctggccag 

aagcgcatga 

cagtgggagc 

gccagggtcc 

tggcggcagg 

atccagaaga 

tgcagctcag 

ccatttgaag 

gattccggca 



cacctcgagg 
gccgcagctc 
ggcccagcgg 
gctgctcgca 
agaagatcac 
tcaagtcggg 
ccaaggccag 
acgcctcgtt 
tccacatcat 



gaagatggcg 
aggccgagtg 
caacagcagc 
cctgctggtg 
ccggaccaag 
agaggaggac 
gggagacctg 
tgcgctgcgg 
cctccgcact 



gacgaggaga agctgccgcc 
tactacttca accacatcac 
agtggtggca aaaacgggca 
aagcacagcc agtcacggcg 
gaggaggccc tggagctgat 
tttgagtctc tggcctcaca 
ggtgccttca gcagaggtca 
acgggggaga tgagcgggcc 
gagtgagggt ggggagccca 
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cggctgggag 60 
taacgccagc 120 
gggggagcct 180 
gccctcgtcc 240 
caacggctac 300 
gttcagcgac 360 
gatgcagaag 420 
cgtgttcacg 480 
ggcctggcct 540 
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cggggcaggg cagggcggct aggccggcca gctccccctt gcccgccagc cagtggccga 600 

accccccact ccctgccacc gtcacacagt atttattgtt cccacaatgg ctgggagggg 660 

gcccttccag attgggggcc ctggggtccc cactccctgt ccatccccag ttggggctgc 720 

gaccgccaga ttctccctta aggaattgac ttcagcaggg gtgggaggct cccagaccca 780 

gggcagtgtg gtgggagggg tgttccaaag agaaggcctg gtcagcagag ccgccccgtg 840 

tccccccagg tgctggaggc agactcgagg gccgaattgt ttctagttag gccacgctcc 900 

tctgttcagt cgcaaaggtg aacactcatg cggcagccat gggccctctg agcaactgtg 960 

cagacccttt cacccccaat taaacccaga acca 994 

<210> 339 
<211>. 772 
<212> DNA 

<213> Homo sapiens 
<400> 339 

agctcgtgcc gaattcggca cgagccgggt cggagccatg gcggtggcaa attcaagtcc 60 
tgttaacccc gtggtgttct ttgatgtcag tattggcggt caggaagttg gccgcatgaa 120 
gatcgagctc tttgcagacg ttgtgcctaa gacggccgag aactttaggc agttctgcac 180 
cggagaa.ttc aggaaagatg gggttccaat aggatacaaa ggaagcacct tccacagggt 240 
cataaaggat ttcatgattc agggtggaga ttttgttaat ggagatggta ctggagtcgc 300 
cagtatttac cgggggccat ttgcagatga aaattttaaa cttagacact cagctccagg 360 
cctgctttcc atggcgaaca gtggtccaag tacaaatggc tgtcagttct ttatcacctg 420 
ctctaagtgc gattggctgg atgggaagca tgtggtgttt ggaaaaatca tcgatggact 480 
tctagtgatg agaaagattg agaatgttcc cacaggcccc aacaataagc ccaagctacc 540 
tgtggtgatc tcgcagtgtg gggagatgta gtccagacaa agactgaatc aggccttccc 600 
ttcttcttgg tggtgttctt gagtaagata atctggactg gcccccgtct ttgcttccct 660 
gcctgctgct gccccatttg atcaagagac catggaagtg tcagagattc agaatccaag 720 
attgtcttta agttttcaac tgtaaataaa gtttttttgt atgcgtaaaa aa 772 

<210> 340 
<211> 919 
<212> DNA 

<213> Homo sapi ens 
<400> 340 

cgctcgcctc cctcgctcca cgcgcgcccg gacgcggcgg ccaggcttgc gcgtggttcc 60 
cctcccggtg ggcggattcc tgggcaagat gaagtgggtg tgggcgctct tgctgttggc 120 
ggcgtgggca gcggccgagc gcgactgccg agtgagcagc ttccgagtca aggagaactt 180 
cgacaaggct cgcttctctg ggacctggta cgccatggcc aagaaggacc ccgagggcct 240 
ctttctgcag gacaacatcg tcgcggagtt ctcggtggac gagaccggcc agatgagcgc 300 
cacagccaag ggccgagtcc gtcttttgaa taactgggac gtgtgcgcag acatggtggg 360 
caccttcaca gacaccgagg accctgccaa gttcaagatg aagtactggg gcgtagcctc 420 
ctttctgcag aaaggaaatg atgaccactg gatcgtcgac acagactacg acacgtatgc 480 
cgtacagtac tcctgccgcc tcctgaacct cgatggcacc tgtgctgaca gctactcctt 540 
cgtgttttcc cgggacccca acggcctgcc cccagaagcg cagaagattg taaggcagcg 600 
gcaggaggag ctgtgcctgg ccaggcagta caggctgatc gtccacaacg gttactgcga 660 
tggcagatca gaaagaaacc ttttgtagca atatcaagaa tctagtttca tctgagaact 720 
tctgattagc tctcagtctt cagctctatt tatcttagga gtttaatttg cccttctctc 780 
cccatcttcc ctcagttccc ataaaacctt cattacacat aaagatacac gtgggggtca 840 
gtgaatctgc ttgcctttcc tgaaagtttc tggggcttaa gattccagac tctgattcat 900 
taaactatag tcacccgtg 919 

<210> 341 
<211> 7365 
<212> DNA 

<213> Homo sapiens 
<400> 341 

ggcagtttgt aggtcgcgag ggaagcgctg aggatcagga agggggcact gagtgtccgt 60 
ggggQSLcLtcc tcgtgatagg aactggaata tgccttgagg gggacactat gtctttaaaa 120 
acgtcggctg gtcatgaggt caggagttcc agaccagcct gaccaacgtg gtgaaactcc 180 
gtctctacta aaaatacaaa aattagccgg gcgtggtgcc gctccagcta ctcaggaggc 240 
tgaggcagga gaatcgctag aacccgggag gcggaggttg cagtgagccg agatcgcgcc 300 
attgcactcc agcctgggcg acagagcgag actgtctcaa aacaaaacaa aacaaaacaa 360 
aacaaaaaac accggctgtt cattggaaca gaaagaaatg gatttatctg ctcttcgcgt 420 
tgaagaagta caaaatgtca ttaatgctat gcagaaaatc ttagagtgtc ccatctgtct 480 
ggagttgatc aaggaacctg tctccacaaa gtgtgaccac atattttgca aattttgcat 540 
gctgaaactt ctcaaccaga agaaagggcc ttcacagtgt cctttatgta agaatgatat 600 
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aaccaaaagg agcctacaag aaagtacgag 
aatcatttgt gcttttcagc ttgacacagg 
aaaaaaggaa aataactctc ctgaacatct 
gggctacaga aaccgtgcca aaagacttct 
ggaaaccagt ctcagtgtcc aactctctaa 
gcagcggata caacctcaaa agacgtctgt 
agataccgtt aataaggcaa cttattgcag 
ccctcaagga accagggatg aaatcagttt 
ttctgagacg gatgtaacaa atactgaaca 
cactgagaag cgtgcagctg agaggcatcc 
cttgcatgtg gagccatgtg gcacaaatac 
cagtttatta ctcactaaag acagaatgaa 
caaacagcct ggcttagcaa ggagccaaca 
taatgatagg cggactccca gcacagaaaa 
tgagagaaaa gaatggaata agcagaaact 
agatgttcct tggataacac taaatagcag 
aagtgatgaa ctgttaggtt ctgatgactc 
agtagctgat gtattggacg ttctaaatga 
aatagactta ctggccagtg atcctcatga 
ctccaaatca gtagagagta atattgaaga 
ggcaagcctc cccaacttaa gccatgtaac 
tgagccacag ataatacaag agcgtcccct 
tacatcaggc cttcatcctg aggattttat 
tcctgaaatg ataaatcagg gaactaacca 
tactaatagt ggtcatgaga ataaaacaaa 
taacccaata gaatcactcg aaaaagaatc 
cagcagtata agcaatatgg aactcgaatt 
gaataggctg aggaggaagt cttctaccag 
tagaaatcta agcccaccta attgtactga 
agagataaag aaaaaaaagt acaaccaaat 
catggaaggt aaagaacctg caactggagc 
aagtaaaaga catgacagcg atactttccc 
ttttactaag tgttcaaata ccagtgaact 
agaagaaaaa gaagagaaac tagaaacagt 
agatctcatg ttaagtggag aaagggtttt 
tatttcattg gtacctggta ctgattatgg 
tagcactcta gggaaggcaa aaacagaacc 
tgaaaacccc aagggactaa ttcatggttg 
ctttaagtat ccattgggac atgaagttaa 
agaaagtgaa cttgatgctc agtatttgca 
atttgctccg ttttcaaatc caggaaatgc 
ctctgggtcc ttaaagaaac aaagtccaaa 
aaatcaagga aagaatgagt ctaatatcaa 
ctttcctgtg gttggtcaga aagataagcc 
aggctctagg ttttgtctat catctcagtt 
aaataaacat ggacttttac aaaacccata 
atttgttaaa actaaatgta agaaaaatct 
gtcacctgaa agagaaatgg gaaatgagaa 
taataacatt agagaaaatg tttttaaaga 
ttccagtact aatgaagtgg gctccagtat 
tcaagcagaa ctaggtagaa acagagggcc 
tttgcaacct gaggtctata aacaaagtct 
aaaaaagcaa gaatatgaag aagtagttca 
gatttcagat aacttagaac agcctatggg 
gacacctgat gacctgttag atgatggtga 
tgacattaag gaaagttctg ctgtttttag 
gagtcctagc cctttcaccc atacacattt 
attagagtcc tcagaagaga acttatctag 
cttgttattt ggtaaagtaa acaatatacc 
taccgagtgt ctgtctaaga acacagagga 
tgactgcagt aaccaggtaa tattggcaaa 
aacaaaatgt tctgctagct tgttttcttc 
aaatacaaac acccaggatc ctttcttgat 
tgaaagccag ggagttggtc tgagtgacaa 
aacgggcttg gaagaaaata atcaagaaga 
agcatctggg tgtgagagtg aaacaagcgt 
gagtgacatt ttaaccactc agcagaggga 
gcaggaaatg gctgaactag aagctgtgtt 



59740~OOOlPCT.txt 

atttagtcaa cttgttgaag agctattgaa 660 
tttggagtat gcaaacagct ataattttgc 720 
aaaagatgaa gtttctatca tccaaagtat 780 
acagagtgaa cccgaaaatc cttccttgca 840 
ccttggaact gtgagaactc tgaggacaaa 900 
ctacattgaa ttgggatctg attcttctga 960 
tgtgggagat caagaattgt tacaaatcac 1020 
ggattctgca aaaaaggctg cttgtgaatt 1080 
tcatcaaccc agtaataatg atttgaacac 1140 
agaaaagtat cagggtagtt ctgtttcaaa 1200 
tcatgccagc tcattacagc atgagaacag 1260 
tgtagaaaag gctgaattct gtaataaaag 1320 
taacagatgg gctggaagta aggaaacatg 1380 
aaaggtagat ctgaatgctg atcccctgtg 1440 
gccatgctca gagaatccta gagatactga 1500 
cattcagaaa gttaatgagt ggttttccag 1560 
acatgatggg gagtctgaat caaatgccaa 1620 
ggtagatgaa tattctggtt cttcagagaa 1680 
ggctttaata tgtaaaagtg aaagagttca 1740 
caaaatattt gggaaaacct atcggaagaa 1800 
tgaaaatcta attataggag catttgttac 1860 
cacaaataaa ttaaagcgta aaaggagacc 1920 
caagaaagca gatttggcag ttcaaaagac 1980 
aacggagcag aatggtcaag tgatgaatat 2040 
aggtgattct attcagaatg agaaaaatcc 2100 
tgctttcaaa acgaaagctg aacctataag 2160 
aaatatccac aattcaaaag cacctaaaaa 2220 
gcatattcat gcgcttgaac tagtagtcag 2280 
attgcaaatt gatagttgtt ctagcagtga 2340 
gccagtcagg cacagcagaa acctacaact 2400 
caagaagagt aacaagccaa atgaacagac 2460 
agagctgaag ttaacaaatg cacctggttc 2520 
taaagaattt gtcaatccta gccttccaag 2580 
taaagtgtct aataatgctg aagaccccaa 2640 
gcaaactgaa agatctgtag agagtagcag 2700 
cactcaggaa agtatctcgt tactggaagt 2760 
aaataaatgt gtgagtcagt gtgcagcatt 2820 
ttccaaagat aatagaaatg acacagaagg 2880 
ccacagtcgg gaaacaagca tagaaatgga 2940 
gaatacattc aaggtttcaa agcgccagtc 3000 
agaagaggaa tgtgcaacat tctctgccca 3060 
agtcactttt gaatgtgaac aaaaggaaga 3120 
gcctgtacag acagttaata tcactgcagg 3180 
agttgataat gccaaatgta gtatcaaagg 3240 
cagaggcaac gaaactggac tcattactcc 3300 
tcgtatacca ccactttttc ccatcaagtc 3360 
gctagaggaa aactttgagg aacattcaat 3420 
cattccaagt acagtgagca caattagccg 3480 
agccagctca agcaatatta atgaagtagg 3540 
taatgaaata ggttccagtg atgaaaacat 3600 
aaaattgaat gctatgctta gattaggggt 3660 
tcctggaagt aattgtaagc atcctgaaat 3720 
gactgttaat acagatttct ctccatatct 3780 
aagtagtcat gcatctcagg tttgttctga 3840 
aataaaggaa gatactagtt ttgctgaaaa 3900 
caaaagcgtc cagaaaggag agcttagcag 3960 
ggctcagggt taccgaagag gggccaagaa 4020 
tgaggatgaa gagcttccct gcttccaaca 4080 
ttctcagtct actaggcata gcaccgttgc 4140 
gaatttatta tcattgaaga atagcttaaa 4200 
ggcatctcag gaacatcacc ttagtgagga 4260 
acagtgcagt gaattggaag acttgactgc 4320 
tggttcttcc aaacaaatga ggcatcagtc 4380 
ggaattggtt tcagatgatg aagaaagagg 4440 
gcaaagcatg gattcaaact taggtgaagc 4500 
ctctgaagac tgctcagggc tatcctctca 4560 
taccatgcaa cataacctga taaagctcca 4620 
agaacagcat gggagccagc cttctaacag 4680 
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ctacccttcc atcataagtg actcttctgc ccttgaggac ctgcgaaatc cagaacaaag 4740 
cacatcagaa aaagcagtat taacttcaca gaaaagtagt gaatacccta taagccagaa 4800 
tccagaaggc ctttctgctg acaagtttga ggtgtctgca gatagttcta ccagtaaaaa 4860 
taaagaacca ggagtggaaa ggtcatcccc ttctaaatgc ccatcattag atgataggtg 4920 
gtacatgcac agttgctctg ggagtcttca gaatagaaac tacccatctc aagaggagct 4980 
cattaaggtt gttgatgtgg aggagcaaca gctggaagag tctgggccac acgatttgac 5040 
ggaaacatct tacttgccaa ggcaagatct agagggaacc ccttacctgg aatctggaat 5100 
cagcctcttc tctgatgacc ctgaatctga tccttctgaa gacagagccc cagagtcagc 5160 
tcgtgttggc aacataccat cttcaacctc tgcattgaaa gttccccaat tgaaagttgc 5220 
agaatctgcc cagagtccag ctgctgctca tactactgat actgctgggt ataatgcaat 5280 
ggaagaaagt gtgagcaggg agaagccaga attgacagct tcaacagaaa gggtcaacaa 5340 
aagaatgtcc atggtggtgt ctggcctgac cccagaagaa tttatgctcg tgtacaagtt 5400 
tgccagaaaa caccacatca ctttaactaa tctaattact gaagagacta ctcatgttgt 5460 
tatgaaaaca gatgctgagt ttgtgtgtga acggacactg aaatattttc taggaattgc 5520 
Oggaggaaaa tgggtagtta gctatttctg ggtgacccag tctattaaag aaagaaaaat 5580 
gctgaatgag catgattttg aagtcagagg agatgtggtc aatggaagaa accaccaagg 5640 
tccaaagcga gcaagagaat cccaggacag aaagatcttc agggggctag aaatctgttg 5700 
ctatgggccc ttcaccaaca tgcccacaga tcaactggaa tggatggtac agctgtgtgg 5760 
tgcttctgtg gtgaaggagc tttcatcatt cacccttggc acaggtgtcc acccaattgt 5820 
ggttgtgcag ccagatgcct ggacagagga caatggcttc catgcaattg ggcagatgtg 5880 
tgaggcacct gtggtgaccc gagagtgggt gttggacagt gtagcactct accagtgcca 5940 
ggagctggac acctacctga taccccagat cccccacagc cactactgac tgcagccagc 6000 
cacaggtaca gagccacagg accccaagaa tgagcttaca aagtggcctt tccaggccct 6060 
gggagctcct ctcactcttc agtccttcta ctgtcctggc tactaaatat tttatgtaca 6120 
tcagcctgaa aaggacttct ggctatgcaa gggtccctta aagattttct gcttgaagtc 6180 
tcccttggaa atctgccatg agcacaaaat tatggtaatt tttcacctga gaagatttta 6240 
aaaccattta aacgccacca attgagcaag atgctgattc attatttatc agccctattc 6300 
tttctattca ggctgttgtt ggcttagggc tggaagcaca gagtggcttg gcctcaagag 6360 
aatagctggt ttccctaagt ttacttctct aaaaccctgt gttcacaaag gcagagagtc 6420 
agacccttca atggaaggag agtgcttggg atcgattatg tgacttaaag tcagaatagt 6480 
ccttgggcag ttctcaaatg ttggagtgga acattgggga ggaaattctg aggcaggtat 6540 
tagaaatgaa aaggaaactt gaaacctggg catggtggct cacgcctgta atcccagcac 6600 
tttgggaggc caaggtgggc agatcactgg aggtcaggag ttcgaaacca gcctggccaa 6660 
catggtgaaa ccccatctct actaaaaata cagaaattag ccggtcatgg tggtggacac 6720 
ctgtaatccc agctactcag gtggctaagg caggagaatc acttcagccc gggaggtgga 6780 
ggttgcagtg agccaagatc ataccacggc actccagcct gggtgacagt gagactgtgg 6840 
ctcaaaaaaa aaaaaaaaaa aggaaaatga aactaggaaa ggtttcttaa agtctgagat 6900 
atatttgcta gatttctaaa gaatgtgttc taaaacagca gaagattttc aagaaccggt 6960 
ttccaaagac agtcttctaa ttcctcatta gtaataagta aaatgtttat tgttgtagct 7020 
ctggtatata atccattcct cttaaaatat aagacctctg gcatgaatat ttcatatcta 7080 
taaaatgaca gatcccacca ggaaggaagc tgttgctttc tttgaggtga tttttttcct 7140 
ttgctccctg ttgctgaaac catacagctt cataaataat tttgcttgct gaaggaagaa 7200 
aaagtgtttt tcataaaccc attatccagg actgtttata gctgttggaa ggactaggtc 7260 
ttccctagcc cccccagtgt gcaagggcag tgaagacttg attgtacaaa atacgttttg 7320 
taaatgttgt gctgttaaca ctgcaaataa acttggtagc aaaca 7365 
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<212> DNA 

<213> Homo sapiens 
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<221> unsure 

<222> CO)... CO) 

<223> n = a, t, c or g 

<400> 342 

attgaggact cggaaatgag gtccaagggt agccaaggat ggctgcagct tcatatgatc 60 
agttgttaaa gcaagttgag gcactgaaga tggagaactc aaatcttcga caagagctag 120 
aagataattc caatcatctt acaaaactgg aaactgaggc atctaatatg aaggaagtac 180 
ttaaacaact acaaggaagt attgaagatg aagctatggc ttcttctgga cagattgatt 240 
tattagagcg tcttaaagag cttaacttag atagcagtaa tttccctgga gtaaaactgc 300 
ggtcaaaaat gtccctccgt tcttatggaa gccgggaagg atctgtatca agccgttctg 360 
gagagtgcag tcctgttcct atgggttcat ttccaagaag agggtttgta aatggaagca 420 
gagaaagtac tggatattta gaagaacttg agaaagagag gtcattgctt cttgctgatc 480 
ttgacaaaga agaaaaggaa aaagactggt attacgctca acttcagaat ctcactaaaa 540 
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gaatagatag tcttccttta actgaaaatt 
aattggaata tgaagcaagg caaatcagag 
aggatatgga aaaacgagca cagcgaagaa 
tacttcgtat acgacagctt ttacagtccc 
acaagcatga aaccggctca catgatgctg 
aaatcaacat ggcaacttct ggtaatggtc 
cagccagtgt tttgagttct agtagcacac 
tgggaaccaa ggtggaaatg gtgtattcat 
atgatatgtc gcgaactttg ctagctatgt 
gacagtctgg atgtcttcct ctcctcatcc 
tattgttggg aaattcccgg ggcagtaaag 
acaacatcat tcactcacag cctgatgaca 
atcttttgga acagatacgc gcttactgtg 
aaccaggcat ggaccaggac aaaaatccaa 
ctgctgtgtg tgttctaatg aaactttcat 
aactaggggg actacaggcc attgcagaat 
ttactaatga ccactacagt attacactaa 
tgacttttgg agatgtagcc aacaaggcta 
cacttgtggc ccaactaaaa tctgaaagtg 
tgaggaattt gtcttggcga gcagatgtaa 
gtgtgaaagc attgatggaa tgtgctttag 
tattgagtgc cttatggaat ttgtcagcac 
ctgtagatgg tgcacttgca tttttggttg 
ctttagccat tattgaaagt ggaggtggga 
caaatgagga ccacaggcaa atcctaagag 
acttaaaatc tcatagtttg acaatagtca 
cagcaagaaa tcctaaagac caggaagcat 
agaacctcat tcattcaaag cacaaaatga 
atctcatggc aaataggcct gcgaagtaca 
gcttgccatc tcttcatgtt aggaaacaaa 
acttatcaga aacttttgac aatatagaca 
agcagagaca caagcaaagt ctctatggtg 
ataataggtc agacaatttt aatactggca 
ctacagtgtt acccagctcc tcttcatcaa 
aagatagaag tttggagaga gaacgcggaa 
aaaatccagg aacttcttca aagcgaggtt 
ccaaagtcat ggaagaagtg tcagccattc 
ctaccactga attacattgt gtgacagatg 
cccatacaca ttcaaacact tacaatttca 
ctatgcctta tgccaaatta gaatacaaga 
gtagtagtga tggttatggt aaaagaggtc 
aagatgatga aagtaagttt tgcagttatg 
tacatagtgc aaatcatatg gatgataatg 
gtcttaaata ttcagatgag cagttgaact 
gatgggcaag acccaaacac ataatagaag 
caaggaatca aagtacaact tatcctgttt 
agttccaacc acattttgga cagcaggaat 
atggttcaga aacaaatcga gtgggttcta 
ctttgtgtca agaagatgac tatgaagatg 
ctgaagaaga acagcatgaa gaagaagaga 
aagagaaacg tcatgtggat cagcctattg 
cttcatcaca gaaacagtca ttttcattct 
ccgaacatat gtcttcaagc agtgagaata 
agaatcagct ccatccaagt tctgcacaga 
cttgcaaagt ttcttctatt aaccaagaaa 
caatatgttt ttcaagatgt agttcattat 
gatgtaatca gacgacacag gaagcagatt 
aagaaaagat tggaactagg tcagctgaag 
agcaccctag aaccaaatcc agcagactgc 
ggcacaaagc tgttgaattt tcttcaggag 
cacccaaaag tccacctgaa cactatgttc 
cttctgtcag ttcacttgat agttttgaga 
aaccatgcag tggaatggta agtggcatta 
gacaaaccat gccaccaagc agaagtaaaa 
ccaagcgaga agtacctaaa aataaagcac 
agcaagctgc agtaaatgct gcagttcaga 
tattacattn tgccacggaa agtactccag 
ctctgagcct cgatgagcca tittatacaga 
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tttccttaca aacagatatg accagaaggc 600 
ttgcgatgga agaacaacta ggtacctgcc 660 
tagccagaat tcagcaaatc gaaaaggaca 720 
aagcaacaga agcagagagg tcatctcaga 780 
agcggcagaa tgaaggtcaa ggagtgggag 840 
agggttcaac tacacgaatg gaccatgaaa 900 
actctgcacc tcgaaggctg acaagtcatc 960 
tgttgtcaat gcttggtact catgataagg 1020 
ctagctccca agacagctgt atatccatgc 1080 
agcttttaca tggcaatgac aaagactctg 1140 
aggctcgggc cagggccagt gcagcactcc 1200 
agagaggcag gcgtgaaatc cgagtccttc 1260 
aaacctgttg ggagtggcag gaagctcatg 1320 
tgccagctcc tgttgaacat cagatctgtc 1380 
ttgatgaaga gcatagacat gcaatgaatg 1440 
tattgcaagt ggactgtgaa atgtacgggc 1500 
gacgatatgc tggaatggct ttgacaaact 1560 
cgctatgctc tatgaaaggc tgcatgagag 1620 
aagacttaca gcaggttatt gcaagtgttt 1680 
atagtaaaaa gacgttgcga gaagttggaa 1740 
aagttaaaaa ggaatcaacc ctcaaaagcg 1800 
attgcactga gaataaagct gatatatgtg 1860 
gcactcttac ttaccggagc cagacaaaca 1920 
tattacggaa tgtgtccagc ttgatagcta 1980 
agaacaactg tctacaaact ttattacaac 2040 
gtaatgcatg tggaactttg tggaatctct 2100 
tatgggacat gggggcagtt agcatgctca 2160 
ttgctatggg aagtgctgca gctttaagga 2220 
aggatgccaa tattatgtct cctggctcaa 2280 
aagccctaga agcagaatta gatgctcagc 2340 
atttaagtcc caaggcatct catcgtagta 2400 
attatgtttt tgacaccaat cgacatgatg 2460 
acatgactgt cctttcacca tatttgaata 2520 
gaggaagctt agatagttct cgttctgaaa 2580 
ttggtctagg caactaccat ccagcaacag 2640 
tgcagatctc caccactgca gcccagattg 2700 
atacctctca ggaagacaga agttctgggt 2760 
agagaaatgc acttagaaga agctctgctg 2820 
ctaagtcgga aaattcaaat aggacatgtt 2880 
gatcttcaaa tgatagttta aatagtgtca 2940 
aaatgaaacc ctcgattgaa tcctattctg 3000 
gtcaataccc agccgaccta gcccataaaa 3060 
atggagaact agatacacca ataaattata 3120 
ctggaaggca aagtccttca cagaatgaaa 3180 
atgaaataaa acaaagtgag caaagacaat 3240 
atactgagag cactgatgat aaacacctca 3300 
gtgtttctcc atacaggtca cggggagcca 3360 
atcatggaat taatcaaaat gtaagccagt 3420 
ataagcctac caattatagt gaacgttact 3480 
gaccaacaaa ttatagcata aaatataatg 3540 
attatagttt aaaatatgcc acagatattc 3600 
caaagagttc atctggacaa agcagtaaaa 3660 
cgtccacacc ttcatctaat gccaagaggc 3720 
gtagaagtgg tcagcctcaa aaggctgcca 3780 
caatacagac ttattgtgta gaagatactc 3840 
catctttgtc atcagctgaa gatgaaatag 3900 
ctgctaatac cctgcaaata gcagaaataa 3960 
atcctgtgag cgaagttcca gcagtgtcac 4020 
agggttctag tttatcttca gaatcagcca 4080 
cgaaatctcc ctccaaaagt ggtgctcaga 4140 
aggagacccc actcatgttt agcagatgta 4200 
gtcgttcgat tgccagctcc gttcagagtg 4260 
taagccccag tgatcttcca gatagccctg 4320 
cacctccacc acctcctcaa acagctcaaa 4380 
ctactgctga aaagagagag agtggaccta 4440 
gggtccaggt tcttccagat gctgatactt 4500 
atggattttc ttgttcatcc agcctgagtg 4560 
aagatgtgga attaagaata atgcctccag 4620 
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ttcaggaaaa tgacaatggg aatgaaacag 
accaagagaa agaggcagaa aaaactattg 
atgatgatga tattgaaata ctagaagaat 
cacgtaaagc aaaaaagcca gcccagactg 
aaccaagtca gctgcctgtg tacaaacttc 
agcatgttag ttttacaccg ggggatgata 
ctataaactt ttccacagct acatctctaa 
agttagctgc tggagaagga gttagaggag 
ataccattcc tacagaaggc agaagtacag 
taaccatacc tgaattggat gacaataaag 
ttaattctgc tatgcccaaa gggaaaagtc 
accaggtcca gcaagcatct gcgtcgtctt 
agaaaaagaa accaacttca ccagtaaaac 
gtgtaagaaa aaatgcagac tcaaaaaata 
acaaagattc aaagaaacag aatttgaaaa 
caaataatga agatagagtc agaggaagtt 
ctattgaagg aactccttac tgtttttcac 
atgatgatga tgttgacctt tccagggaaa 
aggaatcaga ggctaaagtt accagccaca 
ataagacaca agctattgca aagcagccaa 
agaaacaatc cacttttccc cagtcatcca 
atgaaaagtt acagaatttt gctattgaaa 
ctctgagttc tctcagtgac attgaccaag 
aagagactga gccccctgac tcacagggag 
ctcctaaatc atttcatgtt gaagataccc 
gttctcttag tattgactct gaagatgacc 
caaaaaagaa aaagccttca agactcaagg 
tgggtggcat attaggtgaa gatctgacac 
cagaacatgg tctatcccct gattcagaaa 
caaattccat agtaagtagt ttacatcaag 
cttcgtctga ttcagattcc atcctttccc 
ttcatcttac acctgatcaa gaagaaaaac 
taaaaccagg ggagaaaagt acattggaaa 
tcaaaggagg aaaaaaagtt tataaaagtt 
aaatttcagg ccaaatgaaa cagccccttc 
ggacaatgat tcatattcca ggagttcgaa 
aaaaaggccc accccttaag actccagcct 
ccacttctcc tagaggagcc aagccatctg 
agacatccca aataggtggg tcaagtaaag 
ccccttcaag acctgcccag caaccattaa 
caatttcccc tggtagaaat ggaataagtc 
catcatcccc tagtactgct tcaactaagt 
ctccaggtag acagatgagc caacagaacc 
ccagtagtat tccaagaagt gagtctgcct 
atggagccaa taaaaaggta gaactttcta 
aatctgatag atcagaaaga cctgtattag 
caagcccaac cttaagaaga aaattggagg 
catctagacc agcttctccc actaggtccc 
ttcctgatat gtctctatcc acacattcgt 
cacctaatct cagtcccact atagagtata 
ttgcacggtc tcattctgaa agtccttcta 
aacgtgagca cagcaaacat tcatcatccc 
gaagttcatc ttcaattctt tctgcttcat 
atgaaaaaca tgtgaactct atttcaggaa 
caaaaggaac atggagaaaa ataaaagaaa 
agaccgtttc ctcaggtgct acaaatggtg 
cacctgctgt ttctaaaaca gaggatgttt 
atcctagatc tggaagatct cccacaggta 
aaaaggcaaa tccaaacatt aaagattcaa 
atggcagtgt tcccatgcgt accgtgggtt 
tggatgcccc tgaccaaaaa ggaactgaga 
tatcagagac taatgaaagt tctatagtgg 
gcaaacacag ttcacctagt gggactgttg 
caagccctag gaaaagcagc gcagatagca 
cagtgaataa caacacaaag aagcgagatt 
cccaaagtcc taagcgccat tctgggtctt 
gaatgaaact aagaaaattc tatgttaatt 
tgaaacttta aaagactgaa aaattttgta 
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aatcagagca gcctaaagaa tcaaatgaaa 4680 
attctgaaaa ggacctatta gatgattcag 4740 
gtattatttc tgccatgcca acaaagtcat 4800 
cttcaaaatt acctccacct gtggcaagga 4860 
taccatcaca aaacaggttg caaccccaaa 4920 
tgccacgggt gtattgtgtt gaagggacac 4980 
gtgatctaac aatcgaatcc cctccaaatg 5040 
gagcacagtc aggtgaattt gaaaaacgag 5100 
atgaggctca aggaggaaaa acctcatctg 5160 
cagaggaagg tgatattctt gcagaatgca 5220 
acaagccttt ccgtgtgaaa aagataatgg 5280 
ctgcacccaa caaaaatcag ttagatggta 5340 
ctataccaca aaatactgaa tataggacac 5400 
atttaaatgc tgagagagtt ttctcagaca 5460 
ataattccaa ggacttcaat gataagctcc 5520 
ttgcttttga ttcacctcat cattacacgc 5580 
gaaatgattc tttgagttct ctagattttg 5640 
aggctgaatt aagaaaggca aaagaaaata 5700 
cagaactaac ctccaaccaa caatcagcta 5760 
taaatcgagg tcagcctaaa cccatacttc 5820 
aagacatacc agacagaggg gcagcaactg 5880 
atactccagt ttgcttttct cataattcct 5940 
aaaacaacaa taaaigaaaat gaacctatca 6000 
aaccaagtaa acctcaagca tcaggctatg 6060 
cagtttgttt ctcaagaaac agttctctca 6120 
tgttgcagga atgtataagc tccgcaatgc 6180 
gtgataatga aaaacatagt cccagaaata 6240 
ttgatttgaa agatatacag agaccagatt 6300 
attttgattg gaaagctatt caggaaggtg 6360 
ctgctgctgc tgcatgttta tctagacaag 6420 
tgaaatcagg aatctctctg ggatcaccat 6480 
cctttacaag taataaaggc ccacgaattc 6540 
ctaaaaagat agaatctgaa agtaaaggaa 6600 
tgattactgg aaaagttcga tctaattcag 6660 
aagcaaacat gccttcaatc tctcgaggca 6720 
atagctcctc aagtacaagt cctgtttcta 6780 
ccaaaagccc tagtgaaggt caaacagcca 6840 
tgaaatcaga attaagccct gttgccaggc 6900 
caccttctag atcaggatct agagattcga 6960 
gtagacctat acagtctcct ggccgaaact 7020 
ctcctaacaa attatctcaa cttccaagga 7080 
cctcaggttc tggaaaaatg tcatatacat 7140 
ttaccaaaca aacaggttta tccaagaatg 7200 
ccaaaggact aaatcagatg aataatggta 7260 
gaatgtcttc aactaaatca agtggaagtg 7320 
tacgccagtc aactttcatc aaagaagctc 7380 
aatctgcttc atttgaatct ctttctccat 7440 
aggcacaaac tccagtttta agtccttccc 7500 
ctgttcaggc tggtggatgg cgaaaactcc 7560 
atgatggaag accagcaaag cgccatgata 7620 
gacttccaat caataggtca ggaacctgga 7680 
ttcctcgagt aagcacttgg agaagaactg 7740 
cagaatccag tgaaaaagca aaaagtgagg 7800 
ccaaacaaag taaagaaaac caagtatccg 7860 
atgaattttc tcccacaaat agtacttctc 7920 
ctgaatcaaa gactctaatt tatcaaatgg 7980 
gggtgagaat tgaggactgt cccattaaca 8040 
atactccccc ggtgattgac agtgtttcag 8100 
aagataatca ggcaaaacaa aatgtgggta 8160 
tggaaaatcg cctgaactcc tttattcagg 8220 
taaaaccagg acaaaataat cctgtccctg 8280 
aacgtacccc attcagttct agcagctcaa 8340 
ctgccagagt gactcctttt aattacaacc 8400 
cttcagctcg gccatctcag atcccaactc 8460 
ccaaaactga cagcacagaa tccagtggaa 8520 
accttgtgac atctgtttaa aagagaggaa 8580 
acaactgcta tatagacatt ttgtttcaaa 8640 
aataggtttg attcttgtta gagggttttt 8700 
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gttctggaag ccatatttga tagtatactt tgtcttcact ggtcttattt tgggaggcac 8760 
tcttgatggt taggaaaaaa atagtaaagc caagtatgtt tgtacagtat gttttacatg 8820 
tatttaaagt agcatcccat cccaacttcc tttaattatt gcttgtctta aaataatgaa 8880 
cactacagat agaaaatatg atatattgct gttatcaatc atttctagat tataaactga 8940 
ctaaacttac atcagggaaa aattggtatt tatgcaaaaa aaaatgtttt tgtccttgtg 9000 
agtccatcta acatcataat taatcatgtg gctgtgaaat tcacagtaat atggttcccg 9060 
atgaacaagc tttacccagc ctgtttgctt tactgcatga atgaaactga tggttcaatt 9120 
tcagaagtaa tgattaacag ttatgtggtc acatgatgtg catagagata gctacagtgt 9180 
aataatttac actattttgt gctccaaaca aaacaaaaat ctgtgtaact gtaaaacatt 9240 
gaatgaaact attttacctg aactagattt tatctgaaag taggtagaat ttttgctatg 9300 
ctgtaatttg ttgtatattc tggtatttga ggtgagatgg ctgctctttt attaatgaga 9360 
catgaattgt gtctcaacag aaactaaatg aacatttcag aataaattat tgctgtatgt 9420 
aaactgttac tgaaattggt atttgtttga agggtcttgt ttcacatttg tattaataat 9480 
tgtttaaaat gcctctttta aaagcttata taaatttttt ncttcagctt ctatgcatta 9540 
agagtaaaat tcctcttact gtaataaaaa caattgaaga agactgttgc cacttaacca 9600 
ttccatgcgt tggcacttat ctattcctga aattctttta tgtgattagc tcatcttgat 9660 
ttttaacatt tttccactta aacttttttt tcttactcca ctggagctca gtaaaagtaa 9720 
attcatgtaa tagcaatgca agcagcctag cacagactaa gcattgagca taataggccc 9780 
acataatttc ctctttctta atattataga aattctgtac ttgaaattga ttcttagaca 9840 
ttgcagtctc ttcgaggctt tacagtgtaa actgtcttgc cccttcatct tcttgttgca 9900 
actgggtctg acatgaacac tttttatcac cctgtatgtt agggcaagat ctcagcagtg 9960 
aagtataatc agcactttgc catgctcaga aaattcaaat cacatggaac tttagaggta 10020 
gatttaatac gattaagata ttcagaagta tattttagaa tccctgcctg ttaaggaaac 10080 
tttatttgtg gtaggtacag ttctggggta catgttaagt gtccccttat acagtggagg 10140 
gaagtcttcc ttcctgaagg aaaataaact gacacttatt aactaagata atttacttaa 10200 
tatatcttcc ctgatttgtt ttaaaagatc agagggtgac tgatgataca tgcatacata 10260 
tttgttgaat aaatgaaaat ttatttttag tgataagatt catacactct gtatttgggg 10320 
agagaaaacc tttttaagca tggtggggca ctcagatagg agtgaataca cctacctggt 10380 
ggtcat 10386 



<210> 343 

<211> 2191 

<212> DNA 

<213> Homo sapiens 

<400> 343 

ggtggccgag cgggggaccg ggaagcatgg cccgggggtc ggcggttgcc tgggcggcgc 60 
tcgggccgtt gttgtggggc tgcgcgctgg ggctgcaggg cgggatgctg tacccccagg 120 
agagcccgtc gcgggagtgc aaggagctgg acggcctctg gagcttccgc gccgacttct 180 . 
ctgacaaccg acgccggggc ttcgaggagc agtggtaccg gcggccgctg tgggagtcag 240 
gccccaccgt ggacatgcca gttccctcca gcttcaatga catcagccag gactggcgtc 300 
tgcggcattt tgtcggctgg gtgtggtacg aacgggaggt gatcctgccg gagcgatgga 360 
cccaggacct gcgcacaaga gtggtgctga ggattggcag tgcccattcc tatgccatcg 420 
tgtgggtgaa tggggtcgac acgctagagc atgagggggg ctacctcccc ttcgaggccg 480 
acatcagcaa cctggtccag gtggggcccc tgccctcccg gctccgaatc actatcgcca 540 
tcaacaacac actcaccccc accaccctgc caccagggac catccaatac ctgactgaca 600 
cctccaagta tcccaagggt tactttgtcc agaacacata ttttgacttt ttcaactacg 660 
ctggactgca gcggtctgta cttctgtaca cgacacccac cacctacatc gatgacatca 720 
ccgtcaccac cagcgtggag caagacagtg ggctggtgaa ttaccagatc tctgtcaagg 780 
gcagtaacct gttcaagttg gaagtgcgtc ttttggatgc agaaaacaaa gtcgtggcga 840 
atgggactgg gacccagggc caacttaagg tgccaggtgt cagcctctgg tggccgtacc 900 
tgatgcacga acgccctgcc tatctgtatt cattggaggt gcagctgact gcacagacgt 960 
cactggggcc tgtgtctgac ttctacacac tccctgtggg gatccgcact gtggctgtca 1020 
ccaagagcca gttcctcatc aatgggaaac ctttctattt ccacggtgtc aacaagcatg 1080 
aggatgcgga catccgaggg aagggcttcg actggccgct gctggtgaag gacttcaacc 1140 
tgcttcgctg gcttggtgcc aacgctttcc gtaccagcca ctacccctat gcagaggaag 1200 
tgatgcagat gtgtgaccgc tatgggattg tggtcatcga tgagtgtccc ggcgtgggcc 1260 
tggcgctgcc gcagttcttc aacaacgttt ctctgcatca ccacatgcag gtgatggaag 1320 
aagtggtgcg tagggacaag aaccaccccg cggtcgtgat gtggtctgtg gccaacgagc 1380 
ctgcgtccca cctagaatct gctggctact acttgaagat ggtgatcgct cacaccaaat 1440 
ccttggaccc ctcccggcct gtgacctttg tgagcaactc taactatgca gcagacaagg 1500 
gggctccgta tgtggatgtg atctgtttga acagctacta ctcttggtat cacgactacg 1560 
ggcacctgga gttgattcag ctgcagctgg ccacccagtt tgagaactgg tataagaagt 1620 
atcagaagcc cattattcag agcgagtatg gagcagaaac gattgcaggg tttcaccagg 1680 
atccacctct gatgttcact gaagagtacc agaaaagtct gctagagcag taccatctgg 1740 
gtctggatca aaaacgcaga aaatatgtgg ttggagagct catttggaat tttgccgatt 1800 
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tcatgactga acagtcaccg acgagagtgc tggggaataa aaaggggatc ttcactcggc 1860 
agagacaacc aaaaagtgca gcgttccttt tgcgagagag atactggaag attgccaatg 1920 
aaaccaggta tccccactca gtagccaagt cacaatgttt ggaaaacagc ccgtttactt 1980 
gagcaagact gataccacct gcgtgtccct tcctccccga gtcagggcga cttccacagc 2040 
agcagaacaa gtgcctcctg gactgttcac ggcagaccag aacgtttctg gcctgggttt 2100 
tgtggtcatc tattctagca gggaacacta aaggtggaaa taaaagattt tctattatgg 2160 

aaataaagag ttggcatgaa agtcgctact g 2191 

<210> 344 

<211> 2776 

<212> DNA 

<213> Homo sapiens 

<400> 344 

cagggcagac tggtagcaaa gcccccacgc ccagccagga gcaccgccgc ggactccagc 60 

acaccgaggg acatgctggg cctgcgcccc ccactgctcg ccctggtggg gctgctctcc 120 

ctcgggtgcg tcctctctca ggagtgcacg aagttcaagg tcagcagctg ccgggaatgc 180 

atcgagtcgg ggcccggctg cacctggtgc cagaagctga acttcacagg gccgggggat 240 

cctgactcca ttcgctgcga cacccggcca cagctgctca tgaggggctg tgcggctgac 300 

gacatcatgg accccacaag cctcgctgaa acccaggaag accacaatgg gggccagaag 360 

cagctgtccc cacaaaaagt gacgctttac ctgcgaccag gccaggcagc agcgttcaac 420 

gtgaccttcc ggcgggccaa gggctacccc atcgacctgt actatctgat ggacctctcc 480 

tactccatgc ttgatgacct caggaatgtc aagaagctag gtggcgacct gctccgggcc 540 

ctcaacgaga tcaccgagtc cggccgcatt ggcttcgggt ccttcgtgga caagaccgtg 600 

ctgccgttcg tgaacacgca ccctgataag ctgcgaaacc catgccccaa caaggagaaa 660 

gagtgccagc ccccgtttgc cttcaggcac gtgctgaagc tgaccaacaa ctccaaccag 720 

tttcagaccg aggtcgggaa gcagctgatt tccggaaacc tggatgcacc cgagggtggg 780 

ctggacgcca tgatgcaggt cgccgcctgc ccggaggaaa tcggctggcg caacgtcacg 840 

cggctgctgg tgtttgccac tgatgacggc ttccatttcg cgggcgacgg aaagctgggc 900 

gccatcctga cccccaacga cggccgctgt cacctggagg acaacttgta caagaggagc 960 

aacgaattcg actacccatc ggtgggccag ctggcgcaca agctggctga aaacaacatc 1020 

cagcccatct tcgcggtgac cagtaggatg gtgaagacct acgagaaact caccgagatc 1080 

atccccaagt cagccgtggg ggagctgtct gaggactcca gcaatgtggt ccatctcatt 1140 

aagaatgctt acaataaact ctcctccagg gtcttcctgg atcacaacgc cctccccgac 1200 

accctgaaag tcacctacga ctccttctgc agcaatggag tgacgcacag gaaccagccc 1260 

agaggtgact gtgatggcgt gcagatcaat gtcccgatca ccttccaggt gaaggtcacg 1320 

gccacagagt gcatccagga gcagtcgttt gtcatccggg cgctgggctt cacggacata 1380 

gtgaccgtgc aggttcttcc ccagtgtgag tgccggtgcc gggaccagag cagagaccgc 1440 

agcctctgcc atggcaaggg cttcttggag tgcggcatct gcaggtgtga cactggctac 1500 

attgggaaaa actgtgagtg ccagacacag ggccggagca gccaggagct ggaaggaagc 1560 

tgccggaagg acaacaactc catcatctgc tcagggctgg gggactgtgt ctgcgggcag 1620 

tgcctgtgcc acaccagcga cgtccccggc aagctgatat acgggcagta ctgcgagtgt 1680 

gacaccatca actgtgagcg ctacaacggc caggtctgcg gcggcccggg gagggggctc 1740 

tgcttctgcg ggaagtgccg ctgccacccg ggctttgagg gctcagcgtg ccagtgcgag 1800 

aggaccactg agggctgcct gaacccgcgg cgtgttgagt gtagtggtcg tggccggtgc 1860 

cgctgcaacg tatgcgagtg ccattcaggc taccagctgc ctctgtgcca ggagtgcccc 1920 

ggctgcccct caccctgtgg caagtacatc tcctgcgccg agtgcctgaa gttcgaaaag 1980 

ggcccctttg ggaagaactg cagcgcggcg tgtccgggcc tgcagctgtc gaacaacccc 2040 

gtgaagggca ggacctgcaa ggagagggac tcagagggct gctgggtggc ctacacgctg 2100 

gagcagcagg acgggatgga ccgctacctc atctatgtgg atgagagccg agagtgtgtg 2160 

gcaggcccca acatcgccgc catcgtcggg ggcaccgtgg caggcatcgt gctgatcggc 2220 

attctcctgc tggtcatctg gaaggctctg atccacctga. gcgacctccg ggagtacagg 2280 

cgctttgaga aggagaagct caagtcccag tggaacaatg ataatcccct tttcaagagc 2340 

gccaccacga cggtcatgaa ccccaagttt gctgagagtt aggagcactt ggtgaagaca 2400 

aggccgtcag gacccaccat gtctgcccca tcacgcggcc gagacatggc ttggccacag 2460 

ctcttgagga tgtcaccaat taaccagaaa tccagttatt ttccgccctc aaaatgacag 2520 

ccatggccgg ccggtgcttc tgggggctcg tcggggggac agctccactc tgactggcac 2580 

agtctttgca tggagacttg aggagggctt gaggttggtg aggttaggtg cgtgtttcct 2640 

gtgcaagtca ggacatcagt ctgattaaag gtggtgccaa tttatttaca tttaaacttg 2700 

tcagggtata aaatgacatc ccattaatta tattgttaat caatcacgtg tatagaaaaa 2760 

aaaataaaac ttcaat 2776 

<210> 345 

<211> 3160 

<212> DNA 

<213> Homo sapiens 

<400> 345 
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cctcccctcg 
ttccgaggcg 
gatgtggcag 
gcgctcagtt 
tgtgaggcga 
gcccctctca 
gcggcggcgg 
cttcctcggc 
aggcgcggcg 
cggcggcggc 
ccagggctgg 
ggcggcggcg 
caccccccgt 
cgtcttctcc 
cccagtcgct 
gccaagcggc 
tgcagaagaa 
acaggctccc 
atcaagagga 
gatttcctgc 
ttttggattc 
atgacaccgc 
cacagctaga 
acaatcatgt 
gtgcatattt 
gggaagtaag 
attattatag 
acaagatgat 
tggtctgcca 
acaagttcat 
agttcttcca 
atacattctt 
gtgatcaaga 
tagtacttac 
acttttctcc 
atccagaggc 
attatagata 
agcatacaca 
aaataaactt 
gtcagattac 
catccacagg 
tatatacctt 
ctttcccgtt 
attttttcct 
gttcacatcc 
gtttccgaaa 
aatgctcaga 
ttacacacac 
gctgtcactg 
gctaaaggaa 
ccacaaatga 
accatataaa 
atgatttatt 



cccggcgcgg 
cccgggctcc 
gactctttat 
ctctcctctc 
ggccgggctc 
gcgcctgtga 
cagcggcggc 
ttctcctgaa 
gcggcggcgg 
cgcggcggct 
gaacgccgga 
gcacatccag 
ggcccgggct 
ccattccgct 
gcaaccatcc 
ggcagagcga 
gccccgccac 
agacatgaca 
tggattcgac 
agaaagactt 
aaagcataaa 
caaatttaat 
acttatcaaa 
tgcagcaatt 
attacatcgg 
gaccagagac 
ctacctgtta 
gtttgaaact 
gctaaaggtg 
gtactttgag 
caaacagaac 
cataccagga 
aatcgatagc 
tttaacaaaa 
aaattttaag 
tagcagttca 
ttctgacacc 
aattacaaaa 
gaataaactg 
cagttatagg 
gttttgacac 
tttgtgtcaa 
ttattccagt 
tttgtgttct 
tacccctttg 
ggttttgcta 
aaggaaataa 
ctttctttag 
cttgttgttt 
gtgaatctgt 
agggatataa 
tgtggaggct 
aaatatgttt 



39740-0001PCT.txt 
tcccgtccgc ctctcgctcg cctcccgcct cccctcggtc 60 
cggcgcggcg gcggaggggg cgggcaggcc ggcgggcggt 120 
gcgctgcggc aggatacgcg ctcggcgctg ggacgcgact 180 
ggaagctgca gccatgatgg aagtttgaga gttgagccgc 240 
aggcgaggga gatgagagac ggcggcggcc gcggccegga 300 
gcagccgcgg gggcagcgcc ctcggggagc cggccggcct 360 
gtttctcgcc tcctcttcgt cttttctaac cgtgcagcct 420 
agggaaggtg gaagccgtgg gctcgggcgg gagccggctg 480 
cggcacctcc cgctcctgga gcggggggga gaagcggcgg 540 
gcagctccag ggagggggtc tgagtcgcct gtcaccattt 600 
gagttggtct ctccccttct actgcctcca acacggcggc 660 
ggacccgggc cggttttaaa cctcccgtcc gccgccgccg 720 
ccggaggccg ccggcggagg cagccgttcg gaggattatt 780 
gccgccgctg ccaggcctct ggctgctgag gagaagcagg 840 
agcagccgcc gcagcagcca ttacccggct gcggtccaga 900 
ggggcatcag ctaccgccaa gtccagagcc atttccatcc 960 
cagcagcttc tgccatctct ctcctccttt ttcttcagcc 1020 
gccatcatca aagagatcgt tagcagaaac aaaaggagat 1080 
ttagacttga cctatattta tccaaacatt attgctatgg 1140 
gaaggcgtat acaggaacaa tattgatgat gtagtaaggt 1200 
aaccattaca agatatacaa tctttgtgct gaaagacatt 1260 
tgcagagttg cacaatatcc ttttgaagac cataacccac 1320 
cccttttgtg aagatcttga ccaatggcta agtgaagatg 1380 
cactgtaaag ctggaaaggg acgaactggt gtaatgatat 1440 
ggcaaatttt taaaggcaca agaggcccta gatttctatg 1500 
aaaaagggag taactattcc cagtcagagg cgctatgtgt 1560 
aagaatcatc tggattatag accagtggca ctgttgtttc 1620 
attccaatgt tcagtggcgg aacttgcaat cctcagtttg 1680 
aagatatatt cctccaattc aggacccaca cgacgggaag 1740 
ttccctcagc cgttacctgt gtgtggtgat atcaaagtag 1800 
aagatgctaa aaaaggacaa aatgtttcac ttttgggtaa 1860 
ccagaggaaa cctcagaaaa agtagaaaat ggaagtctat 1920 
atttgcagta tagagcgtgc agataatgac aaggaatatc 1980 
aatgatcttg acaaagcaaa taaagacaaa gccaaccgat 2040 
gtgaagctgt acttcacaaa aacagtagag gagccgtcaa 2100 
acttctgtaa caccagatgt tagtgacaat gaacctgatc 2160 
actgactctg atccagagaa tgaacctttt gatgaagatc 2220 
gtctgaattt ttttttatca agagggataa aacaccatga 2280 
aaaatggacc tttttttttt taatggcaat aggacattgt 2340 
aacaattctc ttttcctgac caatcttgtt ttaccctata 2400 
ttgttgtcca gttgaaaaaa ggttgtgtag ctgtgtcatg 2460 
aaggacattt aaaattcaat taggattaat aaagatggca 2520 
tttataaaaa gtggagacag actgatgtgt atacgtagga 2580 
gtcaccaact gaagtggcta aagagctttg tgatatactg 2640 
cacttgtggc aacagataag tttgcagttg gctaagagag 2700 
ccattctaat gcatgtattc gggttagggc aatggagggg 2760 
ttttatgctg gactctggac catataccat ctccagctat 2820 
catgctacag utattaatct ggacattcga ggaattggcc 2880 
gcgcattttt ttttaaagca tattggtgct agaaaaggca 2940 
attggggtac aggaatgaac cttctgcaac atcttaagat 3000 
aaataatgtc ataggtaaga aacacagcaa caatgactta 3060 
atcaacaaag aatgggcttg aaacattata aaaattgaca 3120 
tctcaattgt aaaaaaaaaa 3160 



<210> 346 

<211> 2629 

<212> DNA 

<213> Homo sapiens 



<400> 346 

acttgtcatg 

ggttttcccc 

agagccaccg 

gggagcgtgc 

gggtcactgc 

aaacattttc 

cccaagcaat 

acccaggtcc 



gcgactgtcc 
tcccatgtgc 
tccagggagc 
tttccacgac 
catggaggag 
agacctatgg 
ggatgatttg 
agatgaagct 



agctttgtgc 
tcaagactgg 
aggtagctgc 
ggtgacacgc 
ccgcagtcag 
aaactacttc 
atgctgtccc 
cccagaatgc 



caggagcctc gcaggggttg 
cgctaaaagt tttgagcttc 
tgggctccgg ggacactttg 
ttccctggat tggcagccag 
atcctagcgt cgagccccct 
ctgaaaacaa cgttctgtcc 
cggacgatat tgaacaatgg 
cagaggctgc tccccgcgtg 
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atgggattgg 60 
tcaaaagtct 120 
cgttcgggct 180 
actgccttcc 240 
ctgagtcagg 300 
cccttgccgt 360 
ttcactgaag 420 
gcccctgcac 480 
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cagcagctcc 
tcccttccca 
ggacagccaa 
tggccaagac 
tccgcgccat 
cccaccatga 
tggaaggaaa 
tggtgcccta 
tgtgtaacag 
tggaagactc 
gtcctgggag 
acgagctgcc 
agccaaagaa 
gcttcgagat 
aggagccagg 
cctcccgcca 
cacttcttgt 
gttttgggtc 
atttgctttg 
gaggaggatg 
tttgggagat 
aggtaggtag 
tctctaactt 
cattgtgagg 
ggtctaaaac 
gggttggtag 
ggcccagcca 
caccccatcc 
gacttgtttt 
tttgagactg 
ctgcagcctt 
cacaggttca 
cagtgttgcc 
ccagagtgct 
tacattctgc 
atcccatttt 



tacaccggcg 
gaaaacctac 
gtctgtgact 
ctgccctgtg 
ggccatctac 
gcgctgctca 
tttgcgtgtg 
tgagccgcct 
ttcctgcatg 
cagtggtaat 
agaccggcgc 
cccagggagc 
gaaaccactg 
gttccgagag 
ggggagcagg 
taaaaaactc 
tccccactga 
tttgaaccct 
tcccggggct 
gggagtagga 
gtaagaaatg 
gggcccactt 
caaggcccat 
gttaatgaaa 
ttgaccccct 
tttctacagt 
aaccctgtct 
cacaccctgg 
atgctcaggg 
ggtctcgctt 
tgcctccccg 
tgccaccatg 
caggctggtc 
gggattacaa 
aagcacatct 
tatatcgatc 



39740~000lPCT.txt 
gcccctgcac cagccccctc ctggcccctg tcatcttctg 
cagggcagct acggtttccg tctgggcttc ttgcattctg 
tgcacgtact cccctgccct caacaagatg ttttgccaac 
cagctgtggg ttgattccac acccccgccc ggcacccgcg 
aagcagtcac agcacatgac ggaggttgtg aggcgctgcc 
gatagcgatg gtctggcccc tcctcagcat cttatccgag 
gagtatttgg atgacagaaa cacttttcga catagtgtgg 
gaggttggct ctgactgtac caccatccac tacaactaca 
ggcggcatga accggaggcc catcctcacc atcatcacac 
ctactgggac ggaacagctt tgaggtgcgt gtttgtgcct 
acagaggaag agaatctccg caagaaaggg gagcctcacc 
actaagcgag cactgcccaa caacaccagc tcctctcccc 
gatggagaat atttcaccct tcagatccgt gggcgtgagc 
ctgaatgagg ccttggaact caaggatgcc caggctggga 
gctcactcca gccacctgaa gtccaaaaag ggtcagtcta 
atgttcaaga cagaagggcc tgactcagac tgacattctc 
cagcctccca cccccatctc tccctcccct gccattttgg 
tgcttgcaat aggtgtgcgt cagaagcacc caggacttcc 
ccactgaaca agttggcctg cactggtgtt ttgttgtggg 
cataccagct tagattttaa ggtttttact gtgagggatg 
ttcttgcagt taagggttag tttacaatca gccacattct 
caccgtacta accagggaag ctgtccctca tgttgaattt 
atctgtgaaa tgctggcatt tgcacctacc tcacagagtg 
taatgtacat ctggccttga aaccaccttt tattacatgg 
tgagggtgcc tgttccctct ccctctccct gttggctggt 
tgggcagctg gttaggtaga gggagttgtc aagtcttgct 
gacaacctct tggtcgacct tagtacctaa aaggaaatct 
aggatttcat ctcttgtata tgatgatctg gatccaccaa 
tcaatttctt ttttcttttt tttttttttt tttctttttc 
tgttgcccag gctggagtgg agtggcgtga tcttggctta 
gctcgagcag tcctgcctca gcctccggag tagctgggac 
gccagccaac ttttgcatgt tttgtagaga tggggtctca 
tcaaactcct gggctcaggc gatccacctg tctcagcctc 
ttgtgagcca ccacgtggag ctggaagggt caacatcttt 
gcattttcac cccacccttc ccctccttct ccctttttat 
tcttatttta caataaaact ttgctgcca 



<210> 347 

<211> 3442 

<212> DNA 

<213> Homo sapiens 



<400> 347 

agccggtgcg 

cgctaacggg 

catccgcaac 

caagagtggg 

ggaagaagga 

gaggggctcg 

ggctgacctg 

gaagccattt 

ctttgcaggg 

atgcttcacc 

cctgctgatg 

gaagcctgcg 

cgggttccct 

aatttcttct 
actggtuaaa 
gaagaacccc 
tcagggagtg 
ggagcaggtc 
gggagacccc 
caaaatctta 
ctcagccatg 
caacatgcgg 
aagtatcgaa 
gttcacaaaa 
ctggatcaac 



ccgcagacta 
gccgtggaaa 
ctggaggtca 
aaaaagtttg 
gataagcccg 
ccatggcgcc 
gtggagaggg 
cttcatgctt 
tgggcagaca 
aggcatgagc 
ctggtgtgga 
gagcagacac 
ccaggagtgg 
caccctcaga 
gaagctgcgt 
tgcatcgtgt 
ttcttcaacc 
tactctgagt 
ttcgatgtca 
gagctgatcg 
gaagacaagg 
attgccaaag 
gaagtgataa 
aatctcgaca 
tgctacaacg 



gggcgcctcg ggccagggag cgcggaggag 
acgggcagcc ggacgggaag ccgccggccc 
agttcaccaa gatatttatc aacaatgaat 
ctacatgtaa cccttcaact cgggagcaaa 
acgtggacaa ggctgtggag gctgcacagg 
ggctggatgc cctgagtcgt gggcggctgc 
accgcgccac cttggccgcc ctggagacga 
ttttcatcga cctggagggc tgtattagaa 
aaatccaggg caagaccatc cccacagatg 
ccattggtgt ctgtggggcc atcactccat 
agctggcacc cgccctctgc tgtgggaaca 
ctctcaccgc cctttatctc ggctctctga 
tgaacattgt gccaggattc gggcccacag 
tcaacaagat cgccttcacc ggctccacag 
cccggagcaa tctgaagcgg gtgacgctgg 
gtgcggacgc tgacttggac ttggcagtgg 
aaggccagtg ttgcacggca gcctccaggg 
ttgtcaggcg gagcgtggag tatgccaaga 
aaacagaaca ggggcctcag attgatcaaa 
agagtgggaa gaaggaaggg gccaagctgg 
ggctcttcat caaacccact gtcttctcag 
aggagatttt cgggccagtg caaccaatac 
aaagagcgaa tagcaccgac tatggactca 
aagccctgaa gttggcttct gccttagagt 
ccctctatgc acaggctcca tttggtggct 
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ccatggccac 
tgccgcgccc 
ggcacgaatc 
tatgtgaagt 
ttgccttcca 
tgcaccagct 
tggatacagg 
ccctcagata 
acaacgtcgt 
ggaacttccc 
ccatggtcct 
tcaaagaggc 
tgggagcagc 
aggttggaaa 
agctgggggg 
agtgtgccca 
tgttcgtgga 
aacggcccgt 
agcagttcga 
aatgcggggg 
aagtcacaga 
tgaagttcaa 
cagcagccgt 
ctggaacggt 
ttaaaatgtc 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2629 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



wo 2003/078662 



PCT/US2003/007713 



aggaaatggc 
caccatcaaa 
cggacggcgg 
cccgggacac 
tctcactctc 
gtcctgcctg 
agacagtgag 
gcttgaaatg 
ctgcaagcag 
cacgcctctt 
ttttggtttt 
ttggttagtg 
ccgtgagatt 
agagcttagt 
aatattgttg 
gcaaatccta 
aaattacgca 
ttctccagtt 
taataacaca 
aaataacacc 
ctggcaggtt 
atgttttaac 
gttatccgac 
cttaagtttg 
agtgtgtgta 
attaccacat 
gttaaatcag 
gacactaacc 
ctttaatatg 
gtaagtacac 
tatatatagt 
ctataggcct 
aaataaaatt 



agagaactag 
cttggcgaca 
aatgtggcag 
attcttctgg 
ctgtttattc 
gggagggagc 
atactcaggg 
ctttgccgaa 
ggctcttgca 
tccgaggtga 
tgttttttgt 
cataccgtgg 
cggcttcaaa 
caaagacgat 
aggttacctt 
ggataattca 
tttctatcac 
ctgttaccca 
gccttgttat 
tggattctat 
gtgaattttt 
ttgggaaact 
ggatactttt 
caaattaagt 
taacgaaaac 
ttggcttccc 
gatttggctt 
ttgatagtga 
aaatgctaca 
agaggtggta 
gttaggtttc 
gggaattccg 
ggatatttga 



39740--000lPCT.txt 
gtgaatacgc tttggccgaa tacacagaag tgaaaactgt 1560 
agaacccctg aaggaaaggc ggggctcctt cctcaaacat 1620 
atgaaatgtg ctggaggaaa aaaatgacat ttctgacctt 1680 
aggctttaca tctactggag ttgaatgatt gctgttttcc 1740 
accagactgg ggatgcctat aggttgtctg tgaaatcgca 1800 
tgttggccat ttctgtgttt ccctttaaac cagatcctgg 1860 
cgttgttaac agggagtggt atttgaagtg tccagcagtt 1920 
tctgactcca gtaagaatgt gggaaaaccc cctgtgtgtt 1980 
ccagcggtct cctcagggtg gacctgctta cagagcaagc 2040 
aggtgggacc attccttggg aaaggattca cagtaaggtt 2100 
tttcttgttt ttaaaaaaag gatttcacag tgagaaagtt 2160 
aagggcgcca gggtctttgt ggattgcatg ttgacattga 2220 
ccaatactgc ctttggaata tgacagaatc aatagcccag 2280 
atcacggtct accttaacca aggcactttc ttaagcagaa 2340 
tgctgctaaa gatccaatct tctaacgcca caacagcata 2400 
cctcctcatt tgacaaatca gagctgtaat tcactttaac 2460 
gttcactaac agcttatgat aagtctgtgt agtcttcctt 2520 
atttagatta gtaaagcgta cacaactgga aagactgctg 2580 
ttttaagtcc tattttgata ttaatttctg attagttagt 2640 
ggaggacctc ggtcttcatc caagtggcct gagtatttca 2700 
cttttcctct ttgggaatcc aaatgatgat gtgcaatttc 2760 
gaaagtgttc ccatatagct tcaaaaacaa aaacaaatgt 2820 
atggttacta actagtactt tcctaattgg gaaagtagtg 2880 
tggggagggc aataataaaa tgagggcccg taacagaacc 2940 
catgtataaa atgggcctat cacccttgtc agagatataa 3000 
ttcatcagct aacacttatc acttatacta ccaataactt 3060 
catacactga attttcagta ttttatctca agtagatata 3120 
tacgttagag ggttcctatt cttccattgt acgataatgt 3180 
ttatttataa ttggtagagt tattgtatct ttttatagtt 3240 
tatttaaact tctgtaatat actgtattta gaaatggaaa 3300 
acttctttta aggtttaccc ctgtggtgtg gtttaaaaat 3360 
atcctagctg cagatcgcat cccacaatgc gagaatgata 3420. 
ga 3442 



<210> 348 ; 

<211> 737 

<212> DNA 

<213> Homo sapiens 



<400> 348 

ggagtttcgc 

ttcgaggccg 

aggaggtggt 

ggcagctccc 

gtcacctggg 

acatggtgaa 

actatgaggc 

agaccctgct 

tcgctgacta 

tggatgcgtt 

aggccttcct 

gagggttggg 

aaatttctaa 



cgccgcagtc 
ctgcgcggcc 
gaccgtggag 
caagttccag 
ccgcaccctt 
tgacggcgtg 
gggcaaggat 
gtcccagaac 
caacctgctg 
ccccctgctc 
ggcctcccct 
gggactctga 
gagagct 



ttcgccacca 
ctgcgcatgc 
acgtggcagg 
gacggagacc 
gggctctatg 
gaggacctcc 
gactatgtga 
cagggaggca 
gacttgctgc 
tcagcatatg 
gagtacgtga 
gcgggaggca 



tgccgcccta 
tgctggcaga 
agggctcact 
tcaccctgta 
ggaaggacca 
gctgcaaata 
aggcactgcc 
agaccttcat 
tgatccatga 
tggggcgcct 
acctccccat 
gagtttgcct 



caccgtggtc 
tcagggccag 
caaagcctcc 
ccagtccaat 
gcaggaggca 
catctccctc 
cgggcaactg 
tgtgggagac 
ggtcctagcc 
cagcgcccgg 
caatggcaac 
tcctttctcc 



tatttcccag 60 
agctggaagg 120 
tgcctatacg 180 
accatcctgc 240 
gccctggtgg 300 
atctacacca 360 
aagccttttg 420 
cagatctcct 480 
cctggctgcc 540 
cccaagctca 600 
gggaaacagt 660 
aggaccaata 720 
737 



<210> 349 

<211> 5189 

<212> DNA 

<213> Hoino sapiens 



<400> 349 

atggccaagt 

acctgggtga 

tcaaagaagt 

cttcgtcctg 

gatgaagatg 

tttgatgaaa 

attaaagttt 

attccagtag 



cgggtggctg 
acaatgctgc 
tgtctgttga 
atacatatat 
taggaatgaa 
ttttggttaa 
ctattgatcc 
tagaacacaa 



cggcgcggga 
aaaaaaagaa 
gagagtgtat 
tgggtcagtg 
ttgcagggag 
tgctgctgac 
tgaatctaac 
ggtagagaaa 



gccggcgtgg gcggcggcaa 
gagtcagaaa ctgccaacaa 
cagaagaaga cacaacttga 
gagccattga cgcagttcat 
gttacctttg tgccaggttt 
aataaacaga gggataagaa 
attataagca tttggaataa 
gtttatgttc ctgctttaat 
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cggggcactg 60 
aaatgattct 120 
acacattctt 180 
gtgggtgtat 240 
atacaagatc 300 
catgacttgt 360 
tgggaaaggc 420 
ttttggacag 480 



wo 2003/078662 



PCT/US2003/007713 



cttttaacat ccagtaacta tgatgatgat 
tatggtgcaa aactttgtaa tattttcagt 
gaatacaaac acagttttaa gcagacatgg 
aaaattaaac attttgatgg tgaagattac 
aaatttaaga tggaaaaact .tgacaaggat 
gatttggctg gttcgtgtag aggggtcaag 
aatggatttc gcagttatgt agatctttat 
gccctgaaag ttattcatga gcttgcaaat 
gaaaaaggat tccagcaaat cagctttgta 
cacgtggatt atgtggtaga tcaagttgtt 
aacaaagctg gtgtatcagt gaaaccattt 
aattgcctta ttgaaaatcc aacttttgat 
cccaaaagtt ttgggtctaa atgccagctg 
tgtggcattg tagaaagtat cctgaactgg 
aagaagtgtt catcagtaaa atacagtaaa 
aatgatgctg gtggtaaaca ttccctggag 
gccaaatcac tggctgtgtc tggattaggt 
ccactcaggg gcaaaattct taatgtacgg 
gctgaaataa ataatattat taaaatagtt 
gcagaatctc tgaaaacctt acgctatgga 
gatggttctc acataaaagg cctgcttatt 
ttgaagcatg gttttcttga agagttcatt 
caggaacttt ccttctacag tattcctgaa 
cagaaagcct ggaaaataaa gtactataaa 
aaggaatatt ttgctgatat ggaaaggcat 
gatgatgctg ccattacctt ggcatttagt 
ttaacaaatt ttatggaaga ccggagacag 
ttatatggta ctgcaacaaa gcatttgact 
ctcttctcaa actcagacaa tgaaagatct 
ggccagcgga aagttttatt tacctgtttc 
gcccagttgg ctggctctgt tgctgagatg 
atgatgacta ttgtgaattt ggctcagaac 
cagcctattg gtcagtttgg aactcggctt 
tatattttca caatgttaag cactttagca 
ctccttaagt tcctttatga tgataatcaa 
attcccatgg ttttaataaa tggtgctgag 
cccaactatg atgctaggga aattgtgaac 
cctcatccca tgcttccaaa ctacaaaaac 
aaccagtatg cagtcagtgg tgaaatattt 
gagcttccag ttagaacttg gacacaggta 
aatggaacag ataaaacacc agcattaatt 
actgtgaaat ttgtggtgaa aatgactgaa 
ctgcataaag tttttaaact tcaaactact 
catatgggat gtctgaagaa atatgaaact 
ttacgattaa gttattacgg tttacgtaag 
tctacaaagc ttaacaatca agcccgtttc 
atagagaata ggtcaaagaa agatttgatt 
gacccagtga aagcctggaa agaagcacaa 
aaccagcatg atgatagttc ctccgattca 
attttaaata tgtctctgtg gtctcttact 
agagatgcaa aagggcgaga ggtcaatgat 
aaagaggatt tagcggcatt tgttgaagaa 
gatgttctgg ctggaatgtc tggaaaagca 
aagaaactcc agttggaaga gacaatgccc 
attacagcta tgaaggcaga tgccagcaaa 
gatactgcag cagtaaaagt ggaatttgat 
gcaggagaag aggcat:tgac tccatcagtt 
gagaagaagg agcctggtac cagagtgaga 
gcaaagaaag tgaagaaacg gaatccttgg 
ttggaagaaa cagaacctgt ggttattcca 
gaaagaccta aatacacatt tgatttctca 
gatgatgaca ataatgattt agaggaattg 
ggggaagatg aatttgttcc ttcagatggg 
ggcaaatcaa aagccactcc agaaaaatct 
aatctcttct catttccttc atattctcag 
agtaatgaag aagattctgc ttctgttttt 
aaagttccaa gtaaaacggt agctgctaaa 
aagcccaaga gagccccaaa acagaagaaa 



J9740-000lPCT.txt 

gagaaaaaag ttacaggtgg tcgtaatggt 540 
acaaagttta cagtagaaac agcttgcaaa 600 
atgaataata tgatgaagac ttctgaagcc 660 
acatgcataa cattccaacc agatctgtcc 720 
attgtggccc tcatgactag aagggcatat 780 
gtcatgttta atggaaagaa attgcctgta 840 
gtgaaagaca aattggatga aactggggtg 900 
gaaagatggg atgtttgtct cacattgagt 960 
aatagtattg caactacaaa aggtggacgg 1020 
ggtaaactga ttgaagtagt taagaaaaag 1080 
caagtaaaaa accatatatg ggtttttatt 1140 
tctcagacta aggaaaacat gactctgcag 1200 
tcagaaaaat tttttaaagc agcctctaat 1260 
gtgaaattta aggctcagac tcagctgaat 1320 
atcaaaggta ttcccaaact ggatgatgct 1380 
tgtacactga tattaacaga gggagactct 1440 
gtgattggac gagacagata cggagttttt 1500 
gaagcttctc ataaacagat catggaaaat 1560 
ggtctacaat ataagaaaag ttacgatgat 1620 
aagattatga ttatgaccga tcaggatcaa 1680 
aatttcatcc atcacaattg gccatcactt 1740 
actcctattg taaaggcaag caaaaataag 1800 
tttgacgaat ggaaaaaaca tatagaaaac 1860 
ggattgggta ctagtacagc taaagaagca 1920 
cgcatcttgt ttagatatgc tggtcctgaa 1980 
aagaagaaga ttgatgacag aaaagaatgg 2040 
cgtaggctac atggcttacc agagcaattt 2100 
tataatgatt tcatcaacaa ggaattgatt 2160 
ataccatctc ttgttgatgg ctttaaacct 2220 
aagaggaatg ataaacgtga agtaaaagtt 2280 
tcggcttatc atcatggaga acaagcattg 2340 
tttgtgggaa gtaacaacat taacttgctt 2400 
catggtggca aagatgctgc aagccctcgt 2460 
aggctacttt ttcctgctgt ggatgacaac 2520 
cgtgtagagc ctgagtggta tattcctata 2580 
ggcattggta ctggatgggc ttgtaaacta 2640 
aatgtcagac gaatgctaga tggcctggat 2700 
tttaaaggca cgattcaaga acttggtcaa 2760 
gtagtggaca gaaacacagt agaaattaca 2820 
tataaagaac aggttttaga acctatgcta 2880 
tctgattata aagaatatca tactgacaca 2940 
gagaaactag cacaagcaga agctgctgga 3000 
cttacttgta attccatggt actttttgat 3060 
gtgcaagaca ttctgaaaga attctttgat 3120 
gagtggcttg tgggaatgtt gggagcagaa 3180 
attttagaga agatacaagg gaaaattact 3240 
caaatgttag tccagagagg ttatgaatct 3300 
gaaaaggcag cagaagagga tgaaacacaa 3360 
ggaactcctt caggcccaga ttttaattat 3420 
aaagaaaaag ttgaagaact gattaaacag 3480 
cttaaaagaa aatctccttc agatctttgg 3540 
ctggataaag tggaatctca agaacgagaa 3600 
attaaaggta aagttggcaa acctaaggtg 3660 
tcaccttatg gcagaagaat aattcctgaa 3720 
aagttgctga agaagaagaa gggtgatctt 3780 
gaagaattca gtggagcacc agtagaaggt 3840 
cctataaata aaggtcccaa acctaagagg 3900 
aaaacaccta catcatctgg taaacctagt 3960 
tcagatgatg aatccaagtc agaaagtgat 4020 
agagattctt tgcttaggag agcagcagcc 4080 
gaagaagagg atgatgatgc tgatgatgat 4140 
aaagttaaag catctcccat aacaaatgat 4200 
ttagataaag atgaatatac attttcacca 4260 
ttgcatgaca aaaaaagtca ggattttgga 4320 
aagtcagaag atgattcagc taaatttgac 4380 
tcaccatcat ttggtctgaa acagacagat 4440 
aagggaaaac cgtcttcaga tacagtccct 4500 
gtagtagagg ctgtaaactc tgactcggat 4560 
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tcagaatttg gcattccaaa gaagactaca acaccaaaag gtaaaggccg aggggcaaag 4620 
aaaaggaaag catctggctc tgaaaatgaa ggcgattata accctggcag gaaaacatcc 4680 
aaaacaacaa gcaagaaacc gaagaagaCa tcttttgatc aggattcaga tgtggacatc 4740 
ttcccctcag acttccctac tgagccacct tctctgccac gaaccggtcg ggctaggaaa 4800 
gaagtaaaat attttgcaga gtctgatgaa gaagaagatg atgttgattt tgcaatgttt 4860 
aattaagtgc ccaaagagca caaacatttt tcaacaaata tcttgtgttg tccttttgtc 4920 
ttctctgtct cagacttttg tacatctggc ttattttaat gtgatgatgt aattgacggt 4980 
tttttattat tgtggtaggc cttttaacat tttgttctta cacatacagt tttatgctct 5040 
tttttactca ttgaaatgtc acgtactgtc tgattggctt gtagaattgt tatagactgc 5100 
cgtgcattag cacagatttt aattgtcatg gttacaaaci: acagacctgc tttttgaaat 5160 
gaaatttaaa cattaaaaat ggaactgtg 5189 



<210> 350 

<211> 1536 

<212> DNA 

<213> Homo sapiens 



<400> 350 

gggggggggg 

cccgccgcgc 
tcggagctgc 
ccgccgcacg 
aggaaggacg 
agcctgagag 
gaggagttgc 
gtgaaaatct 
agagaagaag 
tacagagata 
gacaccatca 
gatcttcctc 
agtgagctgt 
aacatcgcca 
ggtgacttta 
aaaattcagc 
gagaaaattg 
actattaaaa 
gtctttaggg 
aactaggtca 
ggcaaatgta 
gtatctgaca 
caaaaacatg 
aagctatttt 
ctgagggagc 
tgttttatat 



ggaccacttg 
cacttcgcct 
cgcgccggcc 
gggagctgca 
accgcacggg 
atgaattccc 
tgtggtttat 
gggatgccaa 
gggacttggg 
tggaatcaga 
aaaccaaccc 
tgatggcgct 
cctgccagct 
gctacgccct 
tacacacttt 
ttcagcgaga 
atgacttcaa 
tggaaatggc 
gttgggctgg 
aaaatctgtc 
actgtgccag 
atgctgaggt 
tatgtgcatt 
tggaatattt 
tgagtaacac 
gttgctataa 



gcctgcctcc 
gcctccgtcc 
cttgcccccc 
gtacctgggg 
caccggcacc 
tctgctgaca 
caagggatcc 
tggatcccga 
cccagtttat 
ttattcagga 
tgacgacaga 
gcctccatgc 
gtaccagaga 
gctcacgtac 
gggagatgca 
acccagacct 
agctgaagac 
tgtttagggt 
atgccgaggt 
cgtgacctat 
ttctttccat 
tatgaacaaa 
tcaatcccac 
ttagaatatt 
catcgatcat 
taaagaagtg 



gtcccgccgc 
cccgcccgcc 
gccgcacagg 
cagatccaac 
ctgtcggtat 
accaaacgtg 
acaaatgcta 
gactttttgg 
ggcttccagt 
cagggagttg 
agaatcatca 
catgccctct 
tcgggagaca 
atgattgcgc 
catatttacc 
ttcccaaagc 
tttcagattg 
gctttcaaag 
aaaagttctt 
cagttattaa 
aataaaaggc 
gtgaggagaa 
gtacttataa 
ttaagaattt 
gatgtagagt 
ttctgc 



gccacttggc 
gcgccatgcc 
s^gcgggacgc 
acatcctccg 
tcggcatgca 
tgttctggaa 
aagagctgtc 
acagcctggg 
ggaggcattt 
accaactgca 
tgtgcgcttg 
gccagttcta 
tgggcctcgg 
acatcacggg 
tgaatcacat 
tcaggattct 
aagggtacaa 
gagcttgaag 
tttgctctaa 
tttttaagga 
tttgagttaa 
tgaaatgtat 
agaaggttgg 
cacaagctat 
gtggttatga 



ctgcctccgt 
tgtggccggc 
cgagccgcgt 
ctgcggcgtc 
ggcgcgctac 
gggtgttttg 
ttccaaggga 
attctccacc 
tggggcagaa 
aagagtgatt 
gaatccaaga 
tgtggtgaac 
tgtgcctttc 
cctgaagcca 
cgagccactg 
tcgaaaagtt 
tccgcatcca 
gatattgtca 
aagaaaaagg 
tgttgccact 
ctcactgagg 
gtgctcttag 
tgaatttcac 
tccctcaaat 
actttatagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1536 



<210> 351 
<211> 2386 
<212> DNA 

<213> Homo sapiens 



<400> 351 

ggaggaggaa 

gggctcccag 

accgcgaggc 

acaaaagatc 

ttctctccct 

acgagactca 

ataaaaatgt 

cagaaaggat 

gaagggactg 

ctggcagaca 

gaggtgatcc 

ctgcggtacc 

gcatatcaga 

gagagccgaa 

ggcgtggttc 



gcaagcgagg 
cctgccgggt 
cggctgagtt 
ttccgggggc 
tcagaatctt 
gtgagtgagc 
ctcagagcaa 
acagctggag 
aatcggagat 
gccccgcggt 
ccatggcagc 
ggcgggcatt 
gctttgaaca 
agggccagga 
tgctgggctc 



gggctggttc 
cgcatgatcc 
accggcatcc 
tgcacctgcc 
atcttggctt 
aggtgttttg 
ccgggagctg 
tcagtttagt 
ggagaccccc 
gaatggagcc 
agtaaagcaa 
cagtgacctg 
ggatactttt 
acgcttcaac 
actcttcagt 



ctgagcttcg caattcctgt 
ctccggccgg agctggtttt 
ccgcagccac ctcctctccc 
tgcctttgcc taaggcggat 
tggatcttag aagagaatca 
gacaatggac tggttgagcc 
gtggttgact ttctctccta 
gatgtggaag agaacaggac 
agtgccatca atggcaaccc 
actggccaca gcagcagttt 
gcgctgaggg aggcaggcga 
acatcccagc tccacatcac 
gtggaactct atgggaacaa 
cgctggttcc tgacgggcat 
cggaaatgac cagacactga 
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gtcgccttct 60 
tttgccagcc 120 
gacctgtgat 180 
ttgaatctct 240 
ctaaccagag 300 
catccctatt 360 
caagctttcc 420 
tgaggcccca 480 
atcctggcac 540 
ggatgcccgg 600 
cgagtttgaa 660 
cccagggaca 720 
tgcagcagcc 780 
gactgtggcc 840 
ccatccactc 900 
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taccctccca 
gagaaccact 
tcccttgcag 
cccttggcct 
tggaagggtt 
tagcctccct 
agaaaggaga 
ccccactctc 
atttaggggc 
gactccctcc 
aagcctctgg 
ccccatgcca 
ctggctccca 
ttatggcctc 
gctcctcatg 
ctatgggagc 
gcatggggtg 
ccccatgcct 
ctcttcaggc 
cagccacccc 
gatatactat 
gcgcgtgcac 
cctggtccag 
acccccaccc 
ttcagtaata 



cccccttctc 
acatgcagcc 
ctagttttct 
cagaattcac 
tgtggctggg 
gaaaatgttt 
ctagattgcc 
cccacactaa 
cacttttgac 
tcacctctgg 
ctagaggcca 
aagctagggt 
tgaccatact 
agctgcctca 
gtgggttcag 
cccagggtct 
gggtagggct 
gtccagcctg 
ctccctctct 
gggctctctg 
ttttgttaag 
gtggagagct 
ggaagctggc 
cacacttgtt 
aactgtgttt 



39740-OOOlPCT. txt 
tgctccacca catcctccgt ccagccgcca 
catgcccacc tgcccatcac agggttgggc 
agaatttatc acacttctgt gagaccccca 
aaaatttcca caaaatctgt ccaaaggagg 
ggcaggaggg ccctacctga ttggtgcaac 
ttctgccagg gagcttgaaa gttttcagaa 
tttgttttga tgtttgtggc ctcagaattg 
cctgggttcc ctttccttcc atccctaccc 
tagggattca ggctgcttgg gataaagatg 
actggctaga gtcctcactc ccagtccaaa 
gccccaccca ggagggaggg ggctatagct 
ggcccttgca gttcagcacc accctagtcc 
gagggaccaa ctgggcccaa gacagatgcc 
cttcctacaa gagcagcctg tggcatcttt 
gggactcagc cctgaggtga aagggagcta 
tccctacctc aggcaggaag ggcaggaagg 
gactagaagg gccagtcctg cctggccagg 
ggcagccagg ctgccaaggc cagagtggcc 
cttctgctcc acccttggcc tgtctcatcc 
ctgtacatat ttgagactag tttttattcc 
cgtgtctgta tttatgtgtg aggagctgct 
ggtgcccgga gattggacgg cctgatgctc 
cgagggtcct ggctcctgag gggcatctgc 
ccagctcttt gaaatagtct gtgtgaaggt 
actcagtgaa aaaaaaaaaa aaaaaa 



ttgccaccag 
ccagatctgg 
cacctcagtt 
ctggcaggta 
ccttacccct 
cctcttcccc 
atcattttcc 
cctaagagcc 
caaggaccag 
tgtcctccag 
acaggaagca 
cttcccctcc 
ccagagctgt 
gccttgggct 
tcaggaacag 
agagcctgct 
cagatctgtg 
tggccaggag 
ccaggggtcc 
ttgtgaagat 
ggcttgcagt 
cctcccctgc 
ccctccccca 
gaaagtgcag 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2386 



<210> 352 

<211> 1270 

<212> DNA 

<213> Horno sapiens 



<400> 352 

agacgttcgc 

gcccgccgcc 

ctcttccaac 

agaccgagac 

gtggtgctgt 

gacacctact 

gttacctcaa 

tctagtagtg 

ccacaaatac 

ccccaaaaca 

ggcaaaaatg 

caaaatttct 

aaatgtcagc 

atgattgaca 

gtgatcctca 

tatgaaggag 

atagcataac 

cataaatgat 

tgatggtttt 

aaggtaaaaa 

ttattttaca 

aaaaaaaaaa 



acacctgggt 

ccgagctccc 

acccgctcgt 

gccgccgctg 

cgctcttgat 

ctgggaagcg 

gaagtgagat 

aaccgtcctc 

ctggctatat 

agacggaaag 

gaaaaaatag, 

gcattcacgg 

aagaatattt 

gtagtttatc 

cagctgttgc 

aagctgagga 

tgaagataaa 

gagtcggtcc 

aaactttcaa 

gtattttttc 

gctcattaaa 



gccagcgccc 
caagccttcg 
tttggcggca 
cgaaggacca 
actcggctca 
tgaaccattt 
gtcttcaggg 
gggagccgac 
tgtcgatgat 
tgaaaatact 
aagaaacaga 
agaatgcaaa 
cggtgaacgg 
aaaaattgca 
tgttattaca 
acgaaagaaa 
attacaggat 
tctttccagt 
ttgtcacttt 
aagttgtaaa 
cttttttaac 



cagaggtccc 
agagcggcgc 
gctcgtgtcc 
atgagagccc 
ggccattatg 
tctggggacc 
agtgagattt 
tatgactact 
tcagtcagag 
tcagataaac 
aagaagaaaa 
tatatagagc 
tgtggggaaa 
ttagcagcca 
gtccagctta 
cttcgacaag 
atcacattgg 
ggatcataag 
ttatgctatt 
taatttattt 
caaacagaaa 



gggacagccc 
acactcccgg 
cagagaccga 
cgctgctacc 
ctgctggatt 
acagtgctga 
cccctgtgag 
cagaagagta 
ttgaacaggt 
ccaaaagaaa 
atccatgtaa 
acctggaagc 
agtccatgaa 
tagctgcctt 
gaagacaata 
agaatggaaa 
agtcactgcc 
acaatggacc 
tctgtatata 
aatatttaat 
aaaaaaaaaa 



gaggcgccgc 
tctccactcg 
gttgccccag 
gccggcgccg 
ggacctcaat 
tggatttgag 
tgaaatgcct 
tgataacgaa 
agttaagccc 
gaaaaaggga 
tgcagaattt 
agtaacatgc 
aactcacagc 
tatgtctgct 
cgtcaggaaa 
tgtacatgct 
aagtcatagc 
ctttttgtta 
aaggtgcacg 
ggaagtgtat 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1270 



<210>- 353 

<211> 1600 

<212> DNA 

<213> Homo sapiens 



<400> 353 

gccccgccgc 

gggcgaggcc 

caggcccgga 

tgacttctcc 

cccggagctg 

cttcgtggcc 

ggccatccga 



cggcagtgga 
gggtacctgg 
gcgatggcag 
ggggaaggga 
atccgcatga 
gctgtggtga 
cttcggggca 



ccgctgtgcg 
gctgggatcc 
ccttgatgac 
gccagggact 
agcgagacgg 
atgggagcgc 
tggatctgga 



cgaaccctga accctacggt 
ggagcaagcg ggegagggca 
cccgggaacc ggggccccac 
tcccgaccct tcgccagagc 
aggccgcctg agcgaagcgg 
gcagggcgca cagatcgggg 
ggagacctcg gtgctgaccc 
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cccgacccgc 60 
gcgccctaag 120 
ccgcgcctgg 180 
ccaagcagct 240 
acatcagggg 300 
ccatgctgat 360 
aggccctggc 420 
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tcagtcggga 
ttccacaggg 
tggctgcaag 
taagctggag 
gctggaccag 
aatcctatat 
ctccattctc 
cggaggggcc 
tggcgtggga 
cctgggtcgc 
cgcaggcccg 
cagcggacac 
cggctcggcc 
ggcccgagcc 
ggagcaggag 
gctggcgctg 
cctgggggtg 
ctggctccgc 
ggaggcgctc 
cgttctgccg 

<210> 354 
<211> 1842 
<212> DNA 
<213> Homo 

<400> 354 

cgatcagatc 

ccgactacag 

attaaacatc 

acttggcaag 

ctgtacaacc 

aaggatggta 

attacattga 

ctgtgcctta 

aatgttagag 

gctgttacac 

attggttatc 

tttgttgttt 

gatttgggag 

ttccatctta 

gtgttcattt 

agtacacatc 

attaccatta 

ttctatactt 

tggtttgaaa 

cattgtcttt 

ttgataatat 

ttttttcctc 

attaattttg 

caaatttatg 

tgtggattat 

ttttgtatct 

aaaaatggct 

ataaaatgtg 

ctttatagaa 

gtggtgtctt 

acaattttta 

<210> 355 
<211> 4975 
<212> DNA 
<213> Homo 

<400> 355 
ctctcacaca 
ttgcaatttg 
gtggctcttg 



39740-000lPCT.txt 
cagcagctgg agtggccaga ggcctggcgc cagcagcttg tggacaagca 480 
ggtgtgggtg acaaggtcag cctggtcctc gcacctgccc tggcggcatg 540 
gtgccaatga tcagcggacg tggtctgggg cacacaggag gcaccttgga 600 
tctattcctg gattcaatgt catccagagc ccagagcaga tgcaagtgct 660 
gcgggctgct gtatcgtggg tcagagtgag cagctggttc ctgcggacgg 720 
gcagccagag atgtgacagc caccgtggac agcctgccac tcatcacagc 780 
agtaagaaac tcgtggaggg gctgtccgct ctggtggtgg acgttaagtt 840 
gccgtcttcc ccaaccagga gcaggcccgg gagctggcaa agacgctggt 900 
gccagcctag ggcttcgggt cgcggcagcg ctgaccgcca tggacaagcc 960 
tgcgtgggcc acgccctgga ggtggaggag gcgctgctct gcatggacgg 1020 
ccagacttaa gggacctggt caccacgctc gggggcgccc tgctctggct 1080 
gcggggactc aggctcaggg cgctgcccgg gtggccgcgg cgctggacga 1140 
cttggccgct tcgagcggat gctggcggcg cagggcgtgg atcccggtct 1200 
ctgtgctcgg gaagtcccgc agaacgccgg cagctgctgc ctcgcgcccg 1260 
gagctgctgg cgcccgcaga tggcaccgtg gagctggtcc gggcgctgcc 1320 
gtgctgcacg agctcggggc cgggcgcagc cgcgctgggg agccgctccg 1380 
ggcgcagagc tgctggtcga cgtgggtcag aggctgcgcc gtgggacccc 1440 
gtgcaccggg acggccccgc gctcagcggc ccgcagagcc gcgccctgca 1500 
gtactctccg accgcgcgcc attcgccgcc ccctcgccct tcgcagagct 1560 
ccgcagcaat aaagctcctt tgccgcgaaa 1600 



sapiens 



gatctaagat 
aagaggagaa 
ccctacagaa 
caaacctgcg 
atatccagtt 
ttgagcctat 
acaaacagca 
ttggagaatc 
ctaaaggtga 
atatagggag 
agtcccacgc 
aagaagacac 
ctgaaccaaa 
atgttactca 
tttttttttt 
cccagaatcc 
attacataca 
tacaggaaaa 
gattatgatg 
gctgtactct 
tagtgattct 
atgtaacatt 
tgcattcttt 
catttcattc 
aaaaaatgta 
ttatgctgta 
caagtagaaa 
tacagtgaat 
ggaatttgta 
ccatgtgaac 
ataaattgaa 



ggcgactgtc 
aacggaatct 
cagatgggca 
gctgatctcc 
gtctagtaat 
gtgggaagat 
gagacgaagt 
ttttgatgac 
taagatagca 
ggtatacaag 
agacacagct 
cttctgagta 
gcctcttcaa 
gatataagag 
tttttggcta 
ataaatgtgt 
gattttacct 
aaaattctgt 
catacagttt 
tgctgatggc 
gatttcagtt 
ggtgaaggat 
ggtaattttt 
aaatcagtga 
acatcataat 
ttttaacact 
agcagtccca 
tgtcttttag 
cttcttattg 
agcataagtt 
taggtagtat 



gaaccggaaa 
aatcaggagg 
ctctggtttt 
aagtttgata 
ttaatgcctg 
gagaaaaaca 
gacctcgatc 
tacagtgatg 
atatggacta 
gaaaggttag 
actaagagcg 
ttctcatagg 
aaagcagagt 
aagtctcatt 
gagtttccac 
tcctggccca 
atccacaata 
tgttccattt 
tctagcaatt 
tgctagattt 
tttcatttgt 
ccaggaatat 
tttgtttttt 
tctatgtttg 
tacattccta 
ttgtattact 
ttcatattaa 
acaactagat 
caggcaagtc 
tggagcacta 
catatatatg 



ccacccctac 
ttgctaaccc 
ttaaaaatga 
ctgttgaaga 
gctgtgacta 
aacggggagg 
gcttttggct 
atgtatgtgg 
ctgaatgtga 
gacttcctcc 
gctccaccac 
agactgcgtc 
ggactgcatt 
cgcctttgtc 
tatcccaatc 
ctctgtaata 
gtcagaaaac 
tatgcagaag 
ttctttgttt 
taatttattt 
tttgcttaaa 
gacacaaagg 
gtaactacaa 
tgtgatttcc 
actagaatta 
taggttattt 
gacagtgtac 
ttgtcctttt 
tctatattat 
gtttgattat 
ga 



tcctaatccc 
agaacactat 
taaaagcaaa 
cttttgggct 
ctcacttttt 
acgatggcta 
agagacactt 
cgctgttgtt 
aaacagagaa 
aaagatagtg 
taaaaatagg 
aagcaatcga 
taaatttgat 
ttgtacttct 
aaagaattac 
gttcagtaga 
aacttggcat 
catattttgc 
ctttttacag 
gtttccctac 
tttttttttt 
tggaataaac 
agctttgcta 
taaacataat 
gtatgtctgt 
tgctttggtt 
aaaactgtaa 
atttctccat 
gtcctctttt 
tatgtttatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1842 



sapiens 

cacacacccc tcccctgcca tccctccccg gactccggct ccggctccga 60 
caacctccgc tgccgtcgcc gcagcagcca ccaattcgcc agcggttcag 120 
cctcgatgtc ctagcctagg ggcccccggg ccggacttgg ctgggctccc 180 
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ttcaccctct gcggagtcat gagggcgaac 
agcctggccc ggggctccga ggtgggcaac 
ggcctgagtg tgaccggcga tgctgagaac 
aggtgtgagg tggtgatggg gaaccttgag 
tccttcctgc agtggattcg agaagtgaca 
tctactctac cattgcccaa cctccgcgtg 
tttgccatct tcgtcatgtt gaactataac 
cgcttgactc agctcaccga gattctgtca 
ctttgtcaca tggacacaat tgactggagg 
gtggtgaagg acaatggcag aagctgtccc 
tggggtcctg gatcagaaga ctgccagaca 
aatggtcact gctttgggcc caaccccaac 
tgctcaggcc ctcaggaca.c agactgcttt 
tgtgtacctc gctgtccaca gcctcttgtc 
aatccccaca ccaagtatca gtatggagga 
gtggtggatc aaacatcctg tgtcagggcc 
aatgggctca agatgtgtga gccttgtggg 
ggctctggga gccgcttcca gactgtggac 
accaagatcc tgggcaacct ggactttctg 
aagatccctg ccctggaccc agagaagctc 
ggttacctga acatccagtc ctggccgccc 
ttgacaacca ttggaggcag aagcctctac 
aacttgaatg tcacatctct gggcttccga 
tatataagtg ccaataggca gctctgctac 
cgggggccta cggaagagcg actagacatc 
gcagagggca aagtgtgtga cccactgtgc 
ggtcagtgct tgtcctgtcg aaattatagc 
tttctgaatg gggagcctcg agaatttgcc 
gaatgccaac ccatgggggg cactgccaca 
caatgtgccc attttcgaga tgggccccac 
ggtgccaagg gcccaatcta caagtaccca 
gagaactgca cccaggggtg taaaggacca 
gtgctgatcg gcaaaaccca tctgacaatg 
attttcatga tgctgggcgg cacttttctc 
agggctatga ggcgatactt ggaacggggt 
aaggctaaca aagtcttggc cagaatcttc 
cttggctcgg gtgtctttgg aactgtgcac 
atcaagattc cagtctgcat taaagtcatt 
gctgtgacag atcatatgct ggccattggc 
ctgggactat gcccagggtc atctctgcag 
ctgctggatc atgtgagaca acaccggggg 
ggagtacaaa ttgccaaggg aatgtactac 
ctggctgccc gaaacgtgct actcaagtca 
gtggctgacc tgctgcctcc tgatgataag 
attaagtgga tggcccttga gagtatccac 
tggagctatg gtgtgacagt ttgggagttg 
ctacgattgg ctgaagtacc agacctgcta 
atctgcacaa ttgatgtcta catggtgatg 
cgcccaacct ttaaagaact agccaatgag 
tatctggtca taaagagaga gagtgggcct 
ctgacaaaca agaagctaga ggaagtagag 
ttggaagcag aggaggacaa cctggcaacc 
gttggaacac ttaatcggcc acgtgggagc 
atgcccatga. accagggtaa tcttgggggg 
agtgaacggt gcccccgtcc agtctctcta 
gagtcatcag aggggcatgt aacaggctct 
tgtagaagcc ggagcaggag ccggagccca 
cagcgccaca gtctgctgac tcctgttacc 
gatgtcaacg gttatgtcat gccagataca 
ggcacccttt cttcagtggg tctcagttct 
gaggagtatg aatacatgaa ccggaggaga 
agttcccttg aggagctggg ttatgagtac 
ctgggcagca cacagagttg cccactccac 
actccagatg aagactatga atatatgaat 
gattatgcag ccatgggggc ctgcccagca 
tttcaggggc ctggacatca ggccccccat 
agcttagagg ctacagactc tgcctttgat 
cccaaggcta atgcccagag aacgtaactc 



59740-O00lPCT.txt 

gacgctctgc aggtgctggg cttgcttttc 240 
tctcaggcag tgtgtcctgg gactctgaat 300 
caataccaga cactgtacaa gctctacgag 360 
attgtgctca cgggacacaa tgccgacctc 420 
ggctatgtcc tcgtggccat gaatgaattc 480 
gtgcgaggga cccaggtcta cgatgggaag 540 
accaactcca gccacgctct gcgccagctc 600 
g99gg'tgttt atattgagaa gaacgataag 660 
gacatcgtga gggaccgaga tgctgagata 720 
ccctgtcatg aggtttgcaa ggggcgatgc 780 
ttgaccaaga ccatctgtgc tcctcagtgt 840 
cagtgctgcc atgatgagtg tgccgggggc 900 
gcctgccggc acttcaatga cagtggagcc 960 
tacaacaagc taactttcca gctggaaccc 1020 
gtttgtgtag ccagctgtcc ccataacttt 1080 
tgtcctcctg acaagatgga agtagataaa 1140 
ggactatgtc ccaaagcctg tgagggaaca 1200 
tcgagcaaca ttgatggatt tgtgaactgc 1260 
atcaccggcc tcaatggaga cccctggcac 1320 
aatgtcttcc ggacagtacg ggagatcaca 1380 
cacatgcaca acttcagtgt tttttccaat 1440 
aaccggggct tctcattgtt gatcatgaag 1500 
tccctgaagg aaattagtgc tgggcgtatc 1560 
caccactctt tgaactggac caaggtgctt 1620 
aagcataatc ggccgcgcag agactgcgtg 1680 
tcctctgggg gatgctgggg cccaggccct 1740 
cgaggaggtg tctgtgtgac ccactgcaac 1800 
catgaggccg aatgcttctc ctgccacccg 1860 
tgcaatggct cgggctctga tacttgtgct 1920 
tgtgtgagca gctgccccca tggagtccta 1980 
gatgttcaga atgaatgtcg gccctgccat 2040 
gagcttcaag actgtttagg acaaacactg 2100 
gctttgacag tgatagcagg attggtagtg 2160 
tactggcgtg ggcgccggat tcagaataaa 2220 
gagagcatag agcctctgga ccccagtgag 2280 
aaagagacag agctaaggaa gcttaaagtg 2340 
aaaggagtgt ggatccctga gggtgaatca 2400 
gaggacaaga gtggacggca gagttttcaa 2460 
agcctggacc atgcccacat tgtaaggctg 2520 
cttgtcactc aatatttgcc tctgggttct 2580 
gcactggggc cacagctgct gctcaactgg 2640 
cttgaggaac atggtatggt gcatagaaac 2700 
cccagtcagg ttcaggtggc agattttggt 2760 
cagctgctat acagtgaggc caagactcca 2820 
tttgggaaat acacacacca gagtgatgtc . 2880 
atgaccttcg gggcagagcc ctatgcaggg 2940 
gagaaggggg agcggttggc acagccccag 3000 
gtcaagtgtt ggatgattga tgagaacatt 3060 
ttcaccagga tggcccgaga cccaccacgg 3120 
ggaatagccc ctgggccaga gccccatggt 3180 
ctggagccag aactagacct agacctagac 3240 
accacactgg gctccgccct cagcctacca 3300 
cagagccttt taagtccatc atctggatac 3360 
tcttgccagg agtctgcagt ttctgggagc 3420 
cacccaatgc cacggggatg cctggcatca 3480 
gaggctgagc tccaggagaa agtgtcaatg 3540 
cggccacgcg gagatagcgc ctaccattcc 3600 
ccactctccc cacccgggtt agaggaagag 3660 
cacctcaaag gtactccctc ctcccgggaa 3720 
gtcctgggta ctgaagaaga agatgaagat 3780 
aggcacagtc cacctcatcc ccctaggcca 3840 
atggatgtgg ggtcagacct cagtgcctct 3900 
cctgtaccca tcatgcccac tgcaggcaca 3960 
cggcaacgag atggaggtgg tcctgggggt 4020 
tctgagcaag ggtatgaaga gatgagagct 4080 
gtccattatg cccgcctaaa aactctacgt 4140 
aaccctgatt actggcatag caggcttttc 4200 
ctgctccctg tggcactcag ggagcattta 4260 
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atggcagcta 
ccagcccctt 
ttgacacaaa 
aggaaaggtt 
cactcctgga 
tggaactagg 
aggaaaccta 
agaagcttaa 
agcacttaac 
tttatcatcc 
tcgggcatgg 
acctgaggca 



gtgcctttag 
ttccccagtc 
attcttatgg 
ttccttattt 
gatatgaagg 
ctcttatgtg 
gcagaggaaa 
aatctgtgaa 
tatgagccag 
ttaaaacaat 
tggctcatgc 
aggagtttga 



39740-000lPCT.txt 
agggtaccgt cttctcccta ttccctctct 
ccagacaatt ccattcaatc tttggaggct 
tatgtagcca gctgtgcact ttcttctctt 
tgtgtgcttt cccagtccca ttcctcagct 
attactctcc atatcccttc ctctcaggct 
tgcctttgtt tcccatcaga ctgtcaagaa 
gtgtaatttt ggtttatgac tcttaacccc 
gaaagaggtt aggagtagat attgattact 
gcatcatact aaacttcacc tacattatct 
tctgtgacat acatattatc tcattttaca 
ctgtaatctc agcactttgg gaggctgagg 
gaccagctta gccaacatag taagaccccc 



ctcccaggtc 
tttaaacatt 
tcccaacccc 
tcttcacagg 
cttgactact 
gaggaaaggg 
ctagaaagac 
atcataattc 
cacttagtcc 
caaagggaag 
cagaaggatt 
atctc 



4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4975 



<210> 356 

<211> 4627 

<212> DNA 

<213> Homo sapiens 



<400> 356 

tcacttgcct 

tgacagccgc 

tccgcagccg 

gaggatggag 

tcttctacag 

tgataactgc 

aacaaagtgt 

ggacatgagt 

cttcttttta 

tattattttg 

aaatcgaaaa 

gttgccgcaa 

taatattaat 

cactgtattt 

ttgctaatgt 

aagtaatttc 

agtgcttaga 

ttctgtaagc 

aatcagtgat 

atatattaaa 

acactacaca 

agctataatt 

acaggtacta 

ttctgtaatc 

atattacaac 

cttgaagcct 

tttgctgaat 

aatatatgaa 

tttattcaag 

agttatgtat 

tggtgtattt 

cacaccttcc 

tttttggaag 

aggagtattt 

ggaggctcct 

tcagggagga 

ataataagcc 

catgcattat 

ttactattag 

tatatatatt 

gtcatttggt 

aattagatct 

agaacacatt 

gattcaacca 

attataacaa 

caggttggag 

gatcctcctg 

ctaatttttg 

ctcctaacct 



gatatttcca 
tctccagcca 
ccctccgcca 
atgctctgtg 
gcagtcctca 
acagctttag 
agctctgaca 
caaaactact 
accgtccacc 
tttcttatca 
agtaaagaac 
gtctgaatgg 
attccatttt 
taatgtactt 
ataatgtgca 
ctgagctaaa 
agtaagcaat 
ctatatatat 
aattggtttg 
acaataatgc 
tttgaaacaa 
ctgttatatg 
gtcactcatc 
ttttatttaa 
atttatgtga 
aaatttgtgc 
tctaacatta 
gacaataatt 
gaagttttaa 
ttaaagttgt 
ttccaaatga 
tgtggaggct 
agacaaagat 
ttgggctatt 
tcatcgaang 
tctgttcttc 
atggctgacc 
ataccctggt 
gaaaataaac 
catatacaaa 
gctctgctaa 
aaacagttat 
ttaaatactc 
accaaggacg 
ctatttactt 
tgcagtggca 
cctcagcctc 
tatttttagt 
tgagatccac 



gtgtcagagg gacacagcca acgtggggtc 
ctgccgcgag cccgtctgct cccgccctgc 
agccccagcg cccgctccca tcgecgatga 
ccggcagggt ccctgcgctg ctgctctgcc 
gtacaactgt gattccatca tgtatcccag 
ttcagacaga agacaatcca cgtgtggctc 
tgaatggcta ttgtttgcat ggacagtgca 
gcaggtgtga agtgggttat actggtgtcc 
aacctttaag caaagagtat gtggctttga 
cagtcgtcgg ttccacatat tatttctgca 
caaagaagga atatgagaga gttacctcag 
cgccatcaaa cttatgggca gggataacag 
attaataata tttatgttgg gtcaagtgtt 
gaaaaatgtt .tttatttttg ttttattttt 
gaaaatattt aatatcaaaa gaaaattgat 
tgcttcattg aaagcttcaa agtttatatg 
tcccaggtca tagctcaaga attgttagca 
agtcaaatcg atttagtaag tatgtttttt 
actgtaccat ggtttgatat gtagttggca 
aattagaatt tgggagaagc aaatataggt 
gctaaccctg gggagtctat ggtctcttca 
aggggcagtg gacagttccc tatgccaact 
taccagattc tgcctatgta aaatgaattg 
gtagtgggca tttcatagct tcacaatgtt 
ggtaattatt gctcaacaga caattagaaa 
tttttaagaa tatttttaga ctatttcttt 
aatcacagcc caaaatttga tggactaatt 
ctacatgttg tcttaagatg gaaatacagt 
ctttaataca gctcagtaaa tggcttcttc 
atcttgacac aggaaatggg aaaaaactta 
aaaatctcaa ttgaaagctt ttaaaatgta 
gagatgaaaa ctagggctca ttttcctgac 
ttcttctgca ctctgagccc ataggtctca 
gcataaggag ccactgctgc caccactttt 
ctaaaccttt gagtagagtc tccctggatc 
ctctacgttt atcctggcat gtgctagggt 
tctggagcac caggtgccag gacttgtctc 
gcaatcacac gactgtcatc taaagtcctg 
agacaaaaac aagtaaatat atatggtcct 
catgtatgta tacatgacct taatggatca 
ccatttatat aaaacttaaa aacaagagaa 
ttctgtttcc tatttaatat agctgaagtc 
tacttacagt tggccctctg tggttagttc 
gaaaatgctt aaaaaataat acaacaacaa 
tttttttttt ctttttgaga tggagtctcg 
cgatctcggc tcactgcaac ctcacctccc 
ctgagcagct gggactacag gcgcatgcca 
agaggcgggg tttcaccatg ttggccagga 
cctccacagc ctcccaaact gctgggatta 
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ccttctaggc 
ccgtgcactc 
ccgcggggag 
tgggtttcca 
gagagtccag 
aagtgtcaat 
tctatctggt 
gatgtgaaca 
ccgtgattct 
gatggtacag 
gggatccaga 
tgtgcctggt 
aggtcaataa 
gacagactat 
atttttatac 
cctggtgcac 
aatgacagat 
atgttcctca 
ccatggtatc 
cctgtgttaa 
ctcaggtctc 
cacgactcct 
aaaaacaatt 
ccttttttgt 
aaagtccaca 
ttataggggc 
attattttaa 
tatttcatct 
tagaatgtaa 
aaaattaata 
gaaacttaaa 
atttgtttat 
gagagttaat 
ggattttatg 
acataccagg 
aaacgaaggc 
catgtgtatc 
gccctggccc 
atacatattg 
tagaattgca 
aagaaaaatc 
aaaatatgta 
cacatctgtg 
caaaaaatac 
ctctgttgcc 
gggttcaaga 
ccatgcccag 
tggtctcaat 
caggcgtgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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ccaccgcacg tagcatttac attaggtatt acaagtaatg taaagatgat ttaagtatac 3000 
aggaggatgt gaataggtta tatgcaagca ctatgccctt ttatataagt gacttgaaca 3060 
tctgtgcccg attttagtat gtgcaggggg gcgatctggg aatcagtccc ctgtggatac 3120 
caaggtacaa ctgtatttat taacgcttac tagatgtgag gagagtctga atattttcag 3180 
tgatcttggc tgtttcaaaa aaatctattg acttttcaat aaatcagctg caatccattt 3240 
atttcattta caaaagattt attgtaagcc tctcaatctt ggtttttcag ttgatcttaa 3300 
gcatgtcaat tcataaaaac aagtcatttt tgtatttttc atctttaaga atgcttaaaa 3360 
aagctaatcc ctaaaatagt tagatctttg taaatgcata ttaaataata aagtatgacc 3420 
cacattactt tttatgggtg aaaataagac aaaaataata gttttagtga ggatggtgct 3480 
gagtaaacat aaaaactgat ttgctctcag ctgatgtgtc ctgtacacag tgggaagatt 3540 
ttagttcaca cttagtctaa ctcccccatt ttacagattt ctcactatat atatttctag 3600 
aaggggctat gcatattcaa tgtattgaga accaaagcaa ccacaaatgc ataaatgcat 3660 
aatttatggt cttcaaccaa ggccacataa taacccagtt aacttactct ttaaccagga 3720 
atattaagtt ctataactag tactcaaggt ttaaccttaa aattaagatt tccttaacct 3780 
taaccttaaa attgatatta tattaaacat acataataca atgtaactcc actgttctcc 3840 
tgaatatttt ttgctctaat ctctctgccg aaagtcaaag tgatgggaga attggtatac 3900 
tggtatgact acgtcttaag tcagattttt atttatgagt ctttgagact aaattcaatc 3960 
accaccaggt atcaaatcaa cttttatgca gcaaatatat gattctagtg tctgactttt 4020 
gttaaattca gtaatgcagt ttttaaaaac ctgtatctga cccactttgt aatttttgct 4080 
ccaatatcca ttctgtagac ttttgaaaaa aaagttttta atttgatgcc caatatattc 4140 
tgaccgttaa aaaattcttg ttcatatggg agaaggggga gtaatgactt gtacaaacag 4200 
tatttctggt gtatatttta atgtttttaa aaagagtaat ttcatttaaa tatctgttat 4260 
tcaaatttga tgatgttaaa tgtaatataa tgtattttct ttttattttg cactctgtaa 4320 
ttgcactttt taagtttgaa gagccatttt ggtaaacggt ttttattaaa gatgctatgg 4380 
aacataaagt t:gtattg|cat gcaatttaaa gtaacttatt tgactatgaa tattatcgga 4440 
ttactgaatt gtatcaattt gtttgtgttc aatatcagct ttgataattg tgtaccttaa 4500 
gatattgaag gagaaaatag ataatttaca agatattatt aatttttatt tatttttctt 4560 
gggaattgaa aaaaattgaa ataaataaaa atgcattgaa catcttgcat tcaaaatctt 4620 
cactgac 4627 

<210> 357 

<211> 2634 

<212> DNA 

<213> Homo sapiens 

<400> 357 

ggcacgaggc tgagtgtccg tctcgcgccc ggaagcgggc gaccgccgtc agcccggagg 60 
aggaggagga ggaggaggag gagggggcgg ccatggggct gctgtcccag ggctcgccgc 120 
tgagctggga ggaaaccaag cgccatgccg accacgtgcg gcggcacggg atcctccagt 180 
tcctgcacat ctaccacgcc gtcaaggacc ggcacaagga cgttctcaag tggggcgatg 240 
aggtggaata catgttggta tcttttgatc atgaaaataa aaaagtccgg ttggtcctgt 300 
ctggggagaa agttcttgaa actctgcaag agaaggggga aaggacaaac ccaaaccatc 360 
ctaccctttg gagaccagag tatgggagtt acatgattga agggacacca ggacagccct 420 
acggaggaac aatgtccgag ttcaatacag ttgaggccaa catgcgaaaa cgccggaagg 480 
aggctacttc tatattagaa gaaaatcagg ctctttgcac aataacttca tttcccagat 540 
taggctgtcc tgggttcaca ctgcccgagg tcaaacccaa cccagtggaa ggaggagctt 600 
ccaagtccct cttctttcca gatgaagcaa taaacaagca ccctcgcttc agtaccttaa 660 
caagaaatat ccgacatagg agaggagaaa aggttgtcat caatgtacca atatttaagg 720 
acaagaatac accatctcca tttatagaaa catttactga ggatgatgaa gcttcaaggg 780 
cttctaagcc ggatcatatt tacatggatg ccatgggatt tggaatgggc aattgctgtc 840 
tccaggtgac attccaagcc tgcagtatat ctgaggccag atacctttat gatcagttgg 900 
ctactatctg tccaattgtt atggctttga gtgctgcatc tcccttttac cgaggctatg 960 
tgtcagacat tgattgtcgc tggggagtga tttctgcatc tgtagatgat agaactcggg 1020 
aggagcgagg actggagcca ttgaagaaca ataactatag gatcagtaaa tcccgatatg 1080 
actcaataga cagctattta tctaagtgtg gtgagaaata taatgacatc gacttgacga 1140 
tagataaaga gatctacgaa cagctgttgc aggaaggcat tgatcatctc ctggcccagc 1200 
atgttgctca tctctttatt agagacccac tgacactgtt tgaagagaaa atacacctgg 1260 
atgatgctaa tgagtctgac cattttgaga atattcagtc cacaaattgg cagacaatga 1320 
gatttaagcc ccctcctcca aactcagaca ttggatggag agtagaattt cgacccatgg 1380 
aggtgcaatt aacagacttt gagaactctg cctatgtggt gtttgtggta ctgctcacca 1440 
gagtgatcct ttcctacaaa ttggattttc tcattccact gtcaaaggtt gatgagaaca 1500 
tgaaggtagc acagaaaaga gatgctgtct tgcagggaat gttttatttc aggaaagata 1560 
tttgcaaagg tggcaatgca gtggtggatg gttgtggcaa ggcccagaac agcacggagc 1620 
tcgctgcaga ggagtacacc ctcatgagca tagacaccat catcaatggg aaggaaggtg 1680 
tgtttcctgg actgatccca attctgaact cttaccttga aaacatggaa gtggatgtgg 1740 
acaccagatg tagtatrtctg aactacctaa agctaattaa gaagagagca tctggagaac 1800 
taatgacagt tgccagatgg atgagggagt ttatcgcaaa ccatcctgac tacaagcaag 1860 
acagtgtcat aactgatgaa atgaattata gccttatttt gaagtgtaac caaattgcaa 1920 
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atgaattatg 
gaagtaaaac 
actggctaca 
ctgtactgtt 
tctagagttt 
aataacatat 
caacctactg 
gagttaaaat 
tgaaattttt 
tgtctacatt 
gcaccttatc, 
gctttcccct 

<210> 358 
<211> 1246 
<212> DNA 
<213> Homo 

<400> 358 

gaccagccta 

cgcgcccccc 

aaaggaggca 

gccgggattg 

atcttcgagc 

atcggagccg 

gagtcccagt 

atagctgcgg 

ttttacaagg 

aaagccatcc 

tcagacatct 

gccatcaaag 

gccgtggtca 

aaccgcgaga 

ttggtgggat 

ccactaattt 

tcttttaatg 

ggtttatatt 

aaaggtacta 

tttaaataga 

ttgatacata 

<210> 359 
<211> 2360 
<212> DNA 
<213> Homo 

<400> 359 

gctacgcggg 

gggcgggctg 

ccaggcttgc 
cgcccagctc 
gcctcctctc 
gctgcagttc 
gctgctggaa 
ggaggcggcc 
gtcactttat 
agaaagaaga 
gacttctgca 
tgccacccgg 
ttatatacct 
tgtctgctgg 
gtggaaaatg 
ctcaaggaat 
gaggggaaag 
tgggcctccc 
tccgatggta 
ccctccacca 
aatgccggtt 
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tgaatgccca gagttacttg gatcagcatt taggaaagta aaatatagtg 1980 

tgactcatcc aactagacat tctacagaaa gaaaaatgca ttattgacga 2040 

gtaccatgcc tctcagcccg tgtgtataat atgaagacca aatgatagaa 2100 

ttctgggcca gtgagccaga aattgantaa ggctttcttt ggtaggtaaa 2160 

atacagtgta catgtacata gtaaagtatt tttgattaac aatgtatttt 2220 

ctaaagtcat catgaactgg cttgtacatt tttaaattct tactctggag 2280 

tctaagcagt tttgtaaatg tactggtaat tgtacaatac ttgcattcca 2340 

gtttactgta aatttttgtt cttttaaaga ctacctggga cctgatttat 2400 

ctctttaaaa acattttctc tcgttaattt tcctttgtca tttcctttgt 2460 

aaatcacttg aatccattga aagtgcttca agggtaatct tgggtttcta 2520 

tatgatgttt cttttgcaat tggaataatc acttggtcac cttgccccaa 2580 

ctgaataaat acccattgaa ctctgaaaaa aaaaaaaaaa aaaa 2634 



sapi ens 



cagccgcctg catctgtatc cagcgccagg tcccgccagt cccagctgcg 60 
agtcccgcac ccgttcggcc caggctaagt tagccctcac catgccggtc 120 
ccaagtgcat caaatacctg. ctgttcggat ttaacttcat cttctggctt 180 
ctgtccttgc cattggacta tggctccgat tcgactctca gaccaagagc 240 
aagaaactaa taataataat tccagcttct acacaggagt ctatattctg 300 
gcgccctcat gatgctggtg ggcttcctgg gctgctgcgg ggctgtgcag 360 
gcatgctggg actgttcttc ggcttcctct tggtgatatt cgccattgaa 420 
ccatctgggg atattcccac aaggatgagg tgattaagga agtccaggag 480 
acacctacaa caagctgaaa accaaggatg agccccagcg ggaaacgctg 540 
actatgcgtt gaactgctgt ggtttggctg ggggcgtgga acagtttatc 600 
gccccaagaa ggacgtactc gaaaccttca ccgtgaagtc ctgtcctgat 660 
aggtcttcga caataaattc cacatcatcg gcgcagtggg catcggcatt 720 
tgatatttgg catgatcttc agtatgatct tgtgctgtgc tatccgcagg 780 
tggtctagag tcagcttaca tccctgagca ggaaagttta cccatgaaga 840 
tttttgtttg tttgttttgt tttgtttgtt gtttgttgtt tgtttttttg 900 
tagtattcat tctgcattgc tagataaaag ctgaagttac tttatgtttg 960 
cttcattcaa tattgacatt tgtagttgag cggggggttt ggtttgcttt 1020 
ttttcagttg tttgtttttg cttgttatat taagcagaaa tcctgcaatg 1080 
tatttgctag actctagaca agatattgta cataaaagaa tttttttgtc 1140 
tacaaatgtc tatcaacttt aatcaagttg taacttatat tgaagacaat 1200 
ataaaaaatt atgacaatgt caaaaaaaaa aaaaaa 1246 



sapi ens 



ccacgctgct ggctggcctg acctaggcgc gcggggtcgg gcggccgcgc 60 
agtgagcaag acaagacact caagaagagc gagctgcgcc tgggtcccgg 120 
acgcagaggc gggcggcaga cggtgcccgg cggaatctcc tgagctccgc 180 
tggtgccagc gcccagtggc cgccgcttcg aaagtgactg gtgcctcgcc 240 
ggtgcgggac catgaagctg ctgccgtcgg tggtgctgaa gctctttctg 300 
tctcggcact ggtgactggc gagagcctgg agcggcttcg gagagggcta 360 
ccagcaaccc ggaccctccc actgtatcca cggaccagct gctaccccta 420 
gggaccggaa agtccgtgac ttgcaagagg cagatctgga ccttttgaga 480 
cctccaagcc acaagcactg gccacaccaa acaaggagga gcacgggaaa 540 
aaggcaaggg gctagggaag aagagggacc catgtcttcg gaaatacaag 600 
tccatggaga atgcaaatat gtgaaggagc tccgggctcc ctcctgcatc 660 
gttaccatgg agagaggtgt catgggctga gcctcccagt ggaaaatcgc 720. 
atgaccacac aaccatcctg gccgtggtgg ctgtggtgct gtcatctgtc 780 
tcatcgtggg gcttctcatg tttaggtacc ataggagagg aggttatgat 840 
aagagaaagt gaagttgggc atgactaatt cccactgaga gagacttgtg 900 
cggctgggga ctgctacctc tgagaagaca caaggtgatt tcagactgca 960 
acttccatct agtcacaaag actccttcgt ccccagttgc cgtctaggat 1020 
ataattgctt tgccaaaata ccagagcctt caagtgccaa acagagtatg 1080 
tctgggtaag aagaaagcaa aagcaaggga ccttcatgcc cttctgattc 1140 
aaccccactt cccctcataa gtttgtttaa acacttatct tctggattag 1200 
aaattccata tgctccagga tctttgactg aaaaaaaaaa agaagaagaa 1260 
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gaaggagagc aagaaggaaa gatttgtgaa ctggaagaaa gcaacaaaga ttgagaagcc 1320 
atgtactcaa gtaccaccaa gggatctgcc attgggaccc tccagtgctg gatttgatga 1380 
gttaactgtg aaataccaca agcctgagaa ctgaattttg ggacttctac ccagatggaa 1440 
aaataacaac tatttttgtt gttgttgttt gtaaatgcct cttaaattat atatttattt 1500 
tattctatgt atgttaattt atttagtttt taacaatcta acaataatat ttcaagtgcc 1560 
tagactgtta ctttggcaat ttcctggccc tccactcctc atccccacaa tctggcttag 1620 
tgccacccac ctttgccaca aagctaggat ggttctgtga cccatctgta gtaatttatt 1680 
gtctgtctac atttctgcag atcttccgtg gtcagagtgc cactgcggga gctctgtatg 1740 
gtcaggatgt aggggttaac ttggtcagag ccactctatg agttggactt cagtcttgcc 1800 
taggcgattt tgtctaccat ttgtgttttg aaagcccaag gtgctgatgt caaagtgtaa 1860 
cagatatcag tgtctccccg tgtcctctcc ctgccaagtc tcagaagagg ttgggcttcc 1920 
atgcctgtag ctttcctggt ccctcacccc catggcccca ggccacagcg tgggaactca 1980 
ctttcccttg tgtcaagaca tttctctaac tcctgccatt cttctggtgc tactccatgc 2040 
aggggtcagt gcagcagagg acagtctgga gaaggtatta gcaaagcaaa aggctgagaa 2100 
ggaacaggga acattggagc tgactgttct tggtaactga ttacctgcca attgctaccg 2160 
agaaggttgg aggtggggaa ggctttgtat aatcccaccc acctcaccaa aacgatgaag 2220 
gtatgctgtc atggtccttt ctggaagttt ctggtgccat ttctgaactg ttacaacttg 2280 
tatttccaaa cctggttcat atttatactt tgcaatccaa ataaagataa cccttattcc 2340 
ataaaaaaaa aaaaaaaaaa 2360 

<210> 360 

<211> 1433 

<212> DNA 

<213> Homo sapiens 



<400> 360 

attcggggcg 

cctgcccgcc 

ctgccgagag 

ccgctgctgc 

ctgttccgct 

gcgccgcccg 

gtccgggagc 

ggcgtctaca 

ctgcccctgc 

tatggcgcca 

gtggagaacc 

aagcccctca 

caccggcaga 

cgaccacccc 

tccaccatgc 

cccaactgtg 

cagcgtgggg 

accatccggg 

gtgcacaccc 

gcccctctcc 

ttccagttct 

cccggcctct 

gaggaagggg 

tttatttttg 



agggaggagg 
cgcccgctcg 
tgggctgccc 
tgctgctact 
gcccgccctg 
ccgcggtggc 
cgggctgcgg 
ccccgcgctg 
aggcgctggt 
gcccggagca 
acgtggacag 
agtcgggtat 
tgggcaaggg 
ctgccaggac 
gccttccgga 
acaagcatgg 
agtgctggtg 
gggaccccga 
agcggatgca 
aaacaccggc 
gacacacgta 
ctcttcccag 
gttgtggtcg 
aacccctgtg 



aagaagcgga 
ctcgctcgcc 
cgcgctgccg 
gggcgcgagt 
cacacccgag 
cgcagtggcc 
ctgctgctcg 
cggccagggg 
catgggcgag 
ggttgcagac 
caccatgaac 
gaaggagctg 
tggcaagcat 
tccctgccaa 
tgagcggggc 
cctgtacaac 
tgtgaacccc 
gtgtcatctc 
gtagaccgca 
agaaaacgga 
tttatatttg 
ctgcagatgc 
gggagctggg 
tcccttttgc 



ggaggcggct 
cgccgcgccg 
ctgccgccgc 
ggcggcggcg 
cgcctggccg 
ggaggcgccc 
gtgtgcgccc 
ctgcgctgct 
ggcacttgtg 
aatggcgatg 
atgttgggcg 
gccgtgttcc 
caccttggcc 
caggaactgg 
cctctggagc 
ctcaaacagt 
aacaccggga 
ttctacaatg 
gccagccggt 
gagtgcttgg 
gaaagagacc 
cacacctgct 
gtacaggttt 
ataagattaa 



cccgctcgca 
cgctgccgac 
cgccgctgct 
gcggggcgcg 
cctgcgggcc 
gcatgccatg 
ggctggaggg 
atccccaccc 
agaagcgccg 
accactcaga 
ggggaggcag 
gggagaaggt 
tggaggagcc 
accaggtcct 
acctctactc 
gcaagatgtc 
agctgatcca 
agcagcagga 
gcctggcgcc 
gtggtgggtg 
agcaccgagc 
ccttcttgct 
ggggaggggg 
aggaaggaaa 



gggccgtgca 
cgccagcatg 
gccgctgctg 
cgcggaggtg 
cccgccggtt 
cgcggagctc 
cgaggcgtgc 
gggctccgag 
ggacgccgag 
aggaggcctg. 
tgctggccgg 
cactgagcag 
caagaagctg 
ggagcggatc 
cctgcacatc 
tctgaacggg 
gggagccccc 
ggcttgcggg 
cctgcccccc 
ctggaggatt 
tcggcacctc 
ttccccgggg 
aagagaaatt 
agt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1433 



<210> 361 

<211> 1632 

<212> DNA 

<213> Homo sapiens 



<400> 361 

gccggccgaa 

gcgacctccg 

aggaaagcgc 

gtcgagtagg 

tgaggtgggg 

cgggcaggcc 

tccttgctgc 

aggcacagtg 

gatggaggag 

acagggaagg 

aagaattcag 



cccagacccg 
caaccttgag 
cgccggccag 
tgtgcaccag 
aagccgggga 
agcctccgag 
cttttgcctg 
gaccaagtgg 
agcagtgtgg 
gaaacacagc 
gcaggtgatg 



aggttttaga 
cggcatccgt 
gcccagctgt 
ccctggcaac 
gggcagttga 
ccgggtggac 
ggaattgcct 
aaggcgatgc 
gagaagaaca 
ttcacaatgg 
aatggctttc 



agcagagtca ggcgaagctg 
ggagtgcgcc tgcgcagcta 
ggccggacag ggactggaag 
gagagcgtct accccgaact 
ggaccccgcg gaggcgcgtg 
acaggtttta aaacatgaat 
cagctactct aacatttgat 
acaacagatt atacggcatg 
tgaagatgat tgaactgcac 
ccatgaacgc ctttggagac 
aaaaccgtaa gcccaggaag 
Page 100 



ggccagaacc 60 
cgaccgcagc 120 
agaggacgcg 180 
ctgctggcct 240 
actggttgag 300 
cctacactca 360 
cacagtttag 420 
aatgaagaag 480 
aatcaggaat 540 
atgaccagtg 600 
gggaaagtgt 660 
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tccaggaacc tctgttttat gaggccccca 
tgactcctgt gaagaatcag ggtcagtgtg 
ctcttgaagg acagatgttc cggaaaactg 
tggtagactg ctctgggcct caaggcaatg 
ctttccagta tgttcaggat aatggaggcc 
caacagaaga atcctgtaag tacaatccca 
tggacatccc taagcaggag aaggccctga 
ctgttgctat tgatgcaggt catgagtcct 
agccagactg tagcagtgaa gacatggatc 
aaagcacaga atcagataac aataaatatt 
ggggcatggg tggctacgta aagatggcca 
cagcagccag ctaccccact gtgtgagctg 
ggatggcgca tgcatgggag gaattcatct 
cacactcgaa tcattgaaga tccgagtgtg 
aaatgttacc tctattttaa ttactgctat 
gactttgcat tttcgttttt aaaaggatgt 
atttcaaatg ta 

<210> 362 

<211> 2756 

<212> DNA 

<213> Homo sapiens 

<400> 362 

atgctgtcct tccagtaccc cgacgtgtac 
ggtcataaaa tttgtgaccc ttacgcctgg 
gcctttgtgg aggcccagaa taagattact 
ggtttatiaca aagagagaat gacngaacta 
aagaaaggaa aacggtattt ttatttttac 
tatgtacagg attccttaga gggtgaggcc 
gacgatggca cagtggcact ccgaggttat 
tatggtctga gtgccagtgg ctcagactgg 
gccaaagagc ttccagatgt gcttgaaaga 
gatgggaagg gaatgttcta caactcatac 
gagacatcta ccaatctcca ccaaaagctc 
gaagatattt tgtgtgctga gtttcctgat 
tctgatgatg gccgctatgt cttgttatca 
ctctggtact gtgacctaca gcaggaatcc 
aaactgattg acaactttga aggggaatat 
acattcaaga cgaatcgcca gtctcccaac 
cctgaagagt ctaagtggaa agtacttgtt 
atagcttgtg tcaggtccaa cttcttggtc 
ctgcagctcc atgacctgac tactggtgct 
agcattgtag ggtacagcgg tcagaagaag 
tttttatctc caggtatcat ttatcactgt 
gttttccgag aggtgaccgt aaaaggaatt 
ttctacccta gcaaggatgg tacgaagatt 
aaattggatg gctctcatcc agctttctta 
acacccaact acagtgtttc caggcttatt 
gtggccaaca tcagaggagg tggcgaatat 
gccaacaaac aaaactgctt tgatgacttt 
ggttacacat ctcccaagag gctgactatt 
gctgcttgtg caaatcagag acctgacctc 
atggacatgc tgaagtttca taaatatacc 
tgctcggaca gcaaacaaca ctttgaatgg 
aagttaccag aagcagatga catccagtac 
gatgaccgcg tggtcccgct tcactccctg 
ggccgcagca ggaagcaaag caaccccctg 
ggggcgggga agcccacagc caaagtgata 
gcgcggtgcc tgaacgtcga ctggattcca 
gacagaaaac ctcaagggct ttcccacgtt 
ccccacggga acattattcc tggactgaca 
ttttatcttt tttaggcttc tcctttttag 
tctaggcaca tgtggttttt tggtgttttt 
tggcaacaaa cacattgtga atattagatt 
tatctatatc cataacctct atatctttaa 
acttctttgc aacaataata aatgttattt 
tcatattgag gggcaactcc agaagtggag 



;9740-000lPCT . txt 

gatctgtgga ttggagagag aaaggctacg 720 

gttcttgttg ggcttttagt gctactggtg 780 

ggaggcttat ctcactgagt gagcagaatc 840 

aaggctgcaa tggtggccta atggattatg 900 

tggactctga ggaatcctat ccatatgagg 960 

agtattctgt tgctaatgac accggctttg 1020 

tgaaggcagt tgcaactgtg gggcccattt 1080 

tcctgttcta taaagaaggc atttattttg 1140 

atggtgtgct ggtggttggc tacggatttg 1200 

ggctggtgaa gaacagctgg ggtgaagaat 1260 

aagaccggag aaaccattgt ggaattgcct 1320 

gtggacggtg atgaggaagg acttgactgg 1380 

tcagtctacc agcccccgct gtgtcggata 1440 

atttgaattc tgtgatattt tcacactggt 1500 

aaataggttt atattattga ttcacttact 1560 

ataaattttt acctgtttaa ataaaattta 1620 

1632 



cgcgacgaga ccgccgtaca ggattatcat 60 
cttgaagacc ccgacagtga acagactaag 120 
gtgccatttc ttgagcagtg tcccatcaga 180 
tatgattatc ccaagtatag ttgccacttc 240 
aatacaggtt tgcagaacca gcgagtatta 300 
agagtgttcc tggaccccaa catactgtct 360 
gcgttcagcg aagatggtga atattttgcc 420 
gtgacaatca agttcatgaa agttgatggt 480 . 
gtcaagttca gctgtatggc ctggacccat 540 
cctcaacagg atggaaaaag tgatggcaca 600 
tactaccatg tcttgggaac cgatcagtca 660 
gaacctaaat ggatgggtgg agctgagtta 720 
ataagggaag gatgtgatcc agtaaaccga 780 
agtggcatcg cgggaatcct gaagtgggta 840 
gactacgtga ccaatgaggg ggcggtgttc 900 
tatcgcgtga tcaacattga cttcagggat 960 
cctgagcatg agaaagatgt cttagaatgg 1020 
ttatgctacc tccatgacgt caagaacatt 1080 
ctccttaaga ccttcccgct cgatgtcggc 1140 
gacactgaaa tcttctatca gtttacttcc 1200 
gatcttacca aagaggagct ggagccaaga 1260 
gatgcttctg attaccagac agtccagatt 1320 ^ 
ccaatgttca ttgtgcataa aaaaagcata 1380 
tatggctatg gcggcttcaa catatccatc 1440 
tttgtgagac acatgggtgg tatcctggca 1500 
ggagagacgt ggcataaagg tggtatcttg 1560 
cagtgtgctg ctgagtatct gatcaaggaa 1620 
aatggaggtt caaatggagg cctcttagtg 1680 
tttggttgtg ttattgccca agttggagta 1740 
atcggccatg cttggaccac tgattatggg 1800 
cttgtcaaat actctccatt gcataatgtg 1860 
ccgtccatgc tgctcctcac tgctgaccat 1920 
aagttcattg ccacccttca gtacatcgtg 1980 
cttatccacg tggacaccaa ggcgggccac 2040 
gaggaagtct cagacatgtt tgcgttcatc 2100 
taaacagttt tcgtgcttcc tcctgacagc 2160 
gacaccaaga aaccactggg cataatgctt 2220 
ggctacagtt gaacagaact gccgtgggaa 2280 
caaggccttg gtgtttcttt ttccaccctg 2340 
tttaagggca tgttgggata aatagctaaa 2400 
gctgaattaa ggatcatagt cgggcatact 2460 
ataaatgtga gaactgttct catggagaag 2520 
aagaatgaca gggatttact tccggtttct 2580 
ttttctgtga gaataaagca tttcaccttt 2640 
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ctgcaacaag ttagttttca agcagttaag tcatagaatg tttgttagct gtgaaaataa 2700 
gttgttcatc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag gaattc 2756 

<210> 363 
<211> 2768 
<212> DNA 

<213> Homo sapiens 

<400> 363 

cactgctgtg cagggcagga aagctccatg cacatagccc agcaaagagc aacacagagc 60 
tgaaaggaag actcagagga gagagataag taaggaaagt agtgatggct ctcatcccag 120 
acttggccat ggaaacctgg cttctcctgg ctgtcagcct ggtgctcctc tatctatatg 180 
gaacccattc acatggactt tttaagaagc ttggaattcc agggcccaca cctctgcctt 240 
ttttgggaaa tattttgtcc taccataagg gcttttgtat gtttgacatg gaatgtcata 300 
aaaagtatgg aaaagtgtgg ggcttttatg atggtcaaca gcctgtgctg gctatcacag 360 
atcctgacat gatcaaaaca gtgctagtga aagaatgtta ttctgtcttc acaaaccgga 420 
ggccttttgg tccagtggga tttatgaaaa gtgccatctc tatagctgag gatgaagaat 480 
ggaagagatt acgatcattg ctgtctccaa ccttcaccag tggaaaactc aaggagatgg 540 
tccctatcat tgcccagtat ggagatgtgt tggtgagaaa tctgaggcgg gaagcagaga 600 
caggcaagcc tgtcaccttg aaagacgtct ttggggccta cagcatggat gtgatcacta 660 
gcacatcatt tggagtgaac atcgactctc tcaacaatcc acaagacccc tttgtggaaa 720 
acaccaagaa gcttttaaga tttgattttt tggatccatt ctttctctca ataacagtct 780 
ttccattcct catcccaatt cttgaagtat taaatatctg tgtgtttcca agagaagtta 840 
caaatttttt aagaaaatct gtaaaaagga tgaaagaaag tcgcctcgaa gatacacaaa 900 
agcaccgagt ggatttcctt cagctgatga ttgactctca gaattcaaaa gaaactgagt 960 
cccacaaagc tctgtccgat ctggagctcg tggcccaatc aattatcttt atttttgctg 1020 
gctatgaaac cacgagcagt gttctctcct tcattatgta tgaactggcc actcaccctg 1080 
atgtccagca gaaactgcag gaggaaattg atgcagtttt acccaataag gcaccaccca 1140 
cctatgatac tgtgctacag atggagtatc ttgacatggt ggtgaatgaa acgctcagat 1200. 
tattcccaat tgctatgaga cttgagaggg tctgcaaaaa agatgttgag atcaatggga 1260 
tgttcattcc caaaggggtg gtggtgatga ttccaagcta tgctcttcac cgtgacccaa 1320 
agtactggac agagcctgag aagttcctcc ctgaaagatt cagcaagaag aacaaggaca 1380 
acatagatcc ttacatatac acaccctttg gaagtggacc cagaaactgc attggcatga 1440 
ggtttgctct catgaacatg aaacttgctc taatcagagt ccttcagaac ttctccttca 1500 
aaccttgtaa agaaacacag atccccctga aattaagctt aggaggactt cttcaaccag 1560 
aaaaacccgt tgttctaaag gttgagtcaa gggatggcac cgtaagtgga gcctgaattt 1620 
tcctaaggac ttctgctttg ctcttcaaga aatctgtgcc tgagaacacc agagacctca 1680 
aattactttg tgaatagaac tctgaaatga agatgggctt catccaatgg actgcataaa 1740 
taaccgggga ttctgtacat gcattgagct ctctcattgt ctgtgtagag tgttatactt 1800 
gggaatataa aggaggtgac caaatcagtg tgaggaggta gatttggctc ctctgcttct 1860 
cacgggacta tttccaccac ccccagttag caccattaac tcctcctgag ctctgataag 1920 
agaatcaaca tttctcaata atttcctcca caaattatta atgaaaataa gaattatttt 1980 
gatggctcta acaatgacat ttatatcaca tgttttctct ggagtattct ataagtttta 2040 
tgttaaatca ataaagacca ctttacaaaa gtattatcag atgctttcct gcacattaag 2100 
gagaaatcta tagaactgaa tgagaaccaa caagtaaata tttttggtca ttgtaatcac 2160 
tgttggcgtg gggcctttgt cagaactaga atttgattat taacataggt gaaagttaat 2220 
ccactgtgac tttgcccatt gtttagaaag aatattcata gtttaattat gccttttttg 2280 
atcaggcaca gtggctcacg cctgtaatcc tagcagtttg ggaggctgag ccgggtggat 2340 
cgcctgaggt caggagttca agacaagcct ggcctacatg gttgaaaccc catctctact 2400 
aaaaatacac aaattagcta ggcatggtgg actcgcctgt aatctcacta cacaggaggc 2460 
tgaggcagga gaatcacttg aacctgggag gcggatgttg aagtgagctg agattgcacc 2520 
actgcactcc agtctgggtg agagtgagac tcagtcttaa aaaaatatgc ctttttgaag 2580 
cacgtacatt ttgtaacaaa gaactgaagc tcttattata ttattagttt tgatttaatg 2640 
ttttcagccc atctcctttc atatttctgg gagacagaaa acatgtttcc ctacacctct 2700 
tgcattccat cctcaacacc caactgtctc gatgcaatga acacttaata aaaaacagtc 2760 
gattggtc 2768 

<210> 364 - 
<211> 2984 
<212> DNA 
<213> Homo sapiens 

<400> 364 

gaggaggaac agaaaagaaa agaaaagaaa aagtgggaaa caaataatct aagaatgagg 60 
agaaagcaag aagagtgacc cccttgtggg cactccattg gttttatggc gcctctactt 120 
tctggagttt gtgtaaaaca aaaatattat ggtctttgtg cacatttaca tcaagctcag 180 
cctgggcggc acagccagat gcgagatgcg tctctgctga tctgagtctg cctgcagcat 240 
ggacctgggt cttccctgaa gcatctccag ggctggaggg acgactgcca tgcaccgagg 300 
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gctcatccat 
cctgatctgt 
caagcccacc 
caggtgtcag 
accctggatt 
catcacctgg 
ctcagagagc 
ctcagcccag 
acaggtggca 
cctgaactcc 
gagcccgagt 
gtggtctcta 
actctcagtg 
ctctgatgct 
gctcgctggc 
gagccgctcc 
gtcggccccc 
cctctcggtg 
gtcacaggga 
gcgtctaaga 
gacctcagcc 
gctgactcac 
ctccccgaca 
cgggtcggat 
ggccgtcatc 
tcagggcaaa 
tgtggggcca 
ccaggaagaa 
ggacactcgg 
ctccagacct 
cacaaaggac 
agccccccag 
tga'gcctcct 
ggccatccac 
gggagctgcc 
actctgggaa 
ttttgaaata 
gaaagtgcat 
actacaagcc 
ctttcctata 

tgaaaccctg 
tcccagctat 
agtgagccaa 
attaaaaaaa 



ccacagagca 
ctcgggctga 
ctctgggctg 

gggggccagg 

acacggatcc 
gaacatgcag 
agtgaccccc 
cccagccccg 
tttgatggct 
cagccccatg 
cgcaggtggt 
cccagtgatc 
cagccaggtic 
ggctacaaca 
gcacagcccc 
tacgggggcc 
agcgaccccc 
cagccgggcc 
tggatgcaaa 
tcaacgtacc 
catgcgggga 
cccagtgacc 
acaggcccca 
ccccagagtg 
ctactgctcc 
cactggacat 
gagcccacag 
aacctctatg 
agcccacacg 
aggagagaaa 
agacaggcgg 
gatgtgacct 
ccatcccagg 
tagcccaggg 
cccccagtgg 
cttttagggg 
aagcaacaga 
taaactgaat 
acgaatgaat 
agaaatttag 
ggcgggcaga 
tctctcctaa 
ttgggaggct 
gatcgcacca 
aaaaaaaaaa 



39740-000lPCT.txt 
gggcagtggg aggagacgcc atgaccccca tcctcacggt 360 
gtctgggccc ccggacccac gtgcaggcag ggcacctccc 420 
aaccaggctc tgtgatcacc caggggagtc ctgtgaccct 480 
agacccagga gtaccgtcta tatagagaaa agaaaacagc 540 
cacaggagct tgtgaagaag ggccagttcc ccatcccatc 600 
ggcggtatcg ctgttactat ggtagcgaca ctgcaggccg 660 
tggagctggt ggtgacagga gcctacatca aacccaccct 720 
tggtgaactc aggagggaat gtaaccctcc agtgtgactc 780 
tcattctgtg taaggaagga gaagatgaac acccacaatg 840 
cccgtgggtc gtcccgcgcc atcttctccg tgggccccgt 900 
ggtacaggtg ctatgcttat gactcgaact ctccctatga 960 
tcctggagct cctggtccta ggtgtttcta agaagccatc 1020 
ctatcgtggc ccctgaggag accctgactc tgcagtgtgg 1080 
gatttgttct gtataaggac ggggaacgtg acttccttca 1140 
aggctgggct ctcccaggcc aacttcaccc tgggccctgt 1200 
agtacagatg ctacggtgca cacaacctct cctccgagtg 1260 
tggacatcct gatcgcagga cagttctatg acagagt etc 1320 
ccacggtggc ctcaggagag aacgtgaccc tgctgtgtca 1380 
ctttccttct gaccaaggag ggggcagctg atgacccatg 1440 
aatctcaaaa ataccaggct gaattcccca tgggtcctgt 1500 
cctacaggtg ctacggctca cagagctcca aaccctacct 1560 
ccctggagct cgtggtctca ggaccgtctg ggggccccag 1620 
cctccacatc tggccctgag gaccagcccc tcacccccac 1680 
gtctgggaag gcacctgggg gttgtgatcg gcatcttggt 1740 
tcctcctcct cctcctcttc ctcatcctcc gacatcgacg 1800 
cgacccagag aaaggctgat ttccaacatc ctgcaggggc 1860 
acagaggcct gcagtggagg tccagcccag ctgccgatgc 1920 
ctgccgtgaa gcacacacag cctgaggatg gggtggagat 1980 
atgaagaccc ccaggcagtg acgtatgccg aggtgaaaca 2040 
tggcctctcc tccttcccca ctgtctgggg aattcctgga 2100 
aagaggacag gcagatggac actgaggctg ctgcatctga 2160 
acgcccagct gcacagcttg acccttagac ggaaggcaac 2220 
aagggccctc tccagctgtg cccagcatct acgccactct 2280 
ggggacgcag accccacact ccatggagtc tggaatgcat 2340 
acaccattgg accccaccca gcctggatct accccaggag 2400 
tcactcaatt ctgcagtata aataactaat gtctctacaa 2460 
cttctcaata atcaatgaag tagctgagaa aactaagtca 2520 
cacaatgtaa atattacaca tcaagcgatg aaactggaaa 2580 
gaattaggaa agaaaaaaag taggaaatga atgatcttgg 2640 
ggcagggcac ggtggctcac gcctgtaatt ccagcacttt 2700 
tcacgagttc aggagatcga gaccatcttg gccaacatgg 2760 
aaatacaaaa attagctgga tgtggtggca gtgcctgtaa 2820 
gaggcaggag aatcgcttga accagggagt cagaggtttc 2880 
ctgctctcca gcctggcgac agagggagac tccatctcaa 2940 
agaaagaaaa aaaaaaaaaa aaaa 2984 



<210> 365 

<211> 3061 

<212> DNA 

<213> Homo sapiens 



<400> 365 

cggcacgagg 

tgctggacga 

aggaagacca 

ggctgattgt 

tgaacaatgc 

ccattgatgc 

ttggctccaa 

gtgtggaggg 

actactgtcc 

tggcctttcc 

cagaatatgg 

agaaggcccc 

tggataaagc 

gaaagaaggg 

agcagatgag 

agttcagtaa 



cgactttggt 
tgtggagctg 
gggaatttat 
caatgtgaat 
ctttgaggag 
tacctatgcc 
gcacgtctcc 
cattgtcact 
tgctactaag 
ctccagctct 
cctttctgtc 
agccggccag 
gaagcctggt 
aggctacacc 
caaggatgct 
aacccgatcc 



ggaggtagtt 
cgggaggctc 
cagagcaaag 
gacctgcgca 
ctggttgcct 
aagcagtatg 
ccgcggactc 
aaatgttctc 
aagaccatag 
gtctatccta 
tacaaggatc 
ctcccccgct 
gaccgggttc 
tctgggacct 
cagccctctt 
aaggatatct 



ctttggcagc gggcatggcg 
agagagatta cctggacttc 
ttcgggagct gatcagtgac 
ggaaaaacga gaagagggct 
tccagcgggc cttaaaggat 
aggagttcta cgtaggactg 
ttacctcctg cttcctcagc 
tagttcgtcc caaagtcgtc 
agcgacgtta ttctgatctc 
ccaaggatga ggagaacaat 
accagaccat caccatccag 
ctgtggacgt cattctggat 
aggtggtggg aacctaccgt 
tcaggactgt cctgattgcc 
tctctgctga ggatatagcc 
ttgaccagct ggccaagtca 
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ggtaccgtgg 60 
ctggacgacg 120 
aaccaatacc 180 
aaccggcttc 240 
tttgtggcct 300 
gaaggcagct 360 
tgtgtggtct 420 
cgcagtgtcc 480 
accaccctgg 540 
ccccttgaga 600 
gagatgccgg 660 
gatgacttgg 720 
tgccttcctg 780 
tgtaatgtta 840 
aagatcaaga 900 
ttggccccaa 960 
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39740-OOOlPCT - txt 

gtatccatgg gcatgactat gtcaagaaag caatcctctg cttgctcttg ggaggggtgg 1020 
aacgagacct agaaaatggc agccacatcc gtggggacat caatattctt ctaataggag 1080 
acccatccgt tgccaagtct cagcttctgc ggtatgtgct ttgcactgca ccccgagcta 1140 
tccccaccac tggccggggc tcctctggag tgggtctgac ggctgctgtc accacagacc 1200 
aggaaacagg agagcgccgt ctggaagcag gggccatggt cctggctgac cgaggcgtgg 1260 
tttgcattga tgaatttgac aaaatgtctg acatggatcg cacagccatc catgaagtga 1320 
tggagcaggg tcgagtgacc attgccaagg ctggcatcca tgctcggctg aatgcccgct 1380 
gcagtgtttt ggcagctgcc aaccctgtct acggcaggta tgaccagtat aagactccaa 1440 
tggagaacat tgggctacag gactcactgc tgtcacgatt tgacttgctc ttcatcatgc 1500 
tggatcagat ggatcctgag caggatcggg agatctcaga ccatgtcctt cggatgcacc 1560 
gttacagagc acctggggag caggatggcg atgctatgcc cttgggtagt gctgtggata 1620 
tcctggccac agatgatccc aactttagcc aggaagatca gcaggacacc cagatttatg 1680 
agaagcatga caaccttcta catgggacca agaagaaaaa ggagaagatg gtgagtgcag 1740 
cattcatgaa gaagtacatc catgtggcca aaatcatcaa gcctgtcctg acacaggagt 1800 
cggccaccta cattgcagaa gagtattcac gcctgcgcag ccaggatagc atgagctcag 1860 
acaccgccag gacatctcca gttacagccc gaacactgga aactctgatt cgactggcca 1920 
cagcccatgc gaaggcccgc atgagcaaga ctgtggacct gcaggatgca gaggaagctg 1980 
tggagttggt ccagtatgct tactttaaga aggttctgga gaaggagaag aaacgtaaga 2040 
agcgaagtga ggatgaatca gagacagaag atgaagagga gaaaagccaa gaggaccagg 2100 
agcagaagag gaagagaagg aagactcgcc agccagatgc caaagatggg gattcatacg 2160 
acccctatga cttcagtgac acagaggagg aaatgcctca agtacacact ccaaagacgg 2220 
cagactcaca ggagaccaag gaatcccaga aagtggagtt gagtgaatcc aggttgaagg 2280 
cattcaaggt ggccctcttg gatgtgttcc gggaagctca tgcgcagtca atcggcatga 2340 
atcgcctcac agaatccatc aaccgggaca gcgaagagcc cttctcttca gttgagatcc 2400 
aggctgctct gagcaagatg caggatgaca atcaggtcat ggtgtctgag ggcatcatct 2460 
tcctcatctg aggaggcctc gtctctgaac ttgggttgtg. ccgagagagt ttgttctgtg 2520 
tttcccaccc tctccctgac ccaagtcttt gcctctactc ccttaacagt gttgaattca 2580 
actgaaggcg aggaatgttg gtgatgaagc tgagttcagg actcggtgga ccctttggga 2640 
atgggtcatg aaagctgcca tggggtgagg aaagaggaga cagtgggaga ggacaatgac 2700 
tattgcatct tcattgcaaa agcactggct catccgccct acttcccatc ccacacaaac 2760 
ccaattgtaa ataacatatg acttctgagt acttttgggg gcacaactgt tttctgtttg 2820 
ctgttttttt gttttgtttt ttttctccag agcactttgg tctagactag gctttgggtg 2880 
gttccaattg gtggagagaa gctctgaggc acgtcatgca ggtcaagaaa gctttctttg 2940 
cagtagcacc agttaaggtg aatatgtatt gtatcacaaa acaaacccaa tatccagatg 3000 
aatatccgag atgttgaata aacttagcca tttcgtacaa aaaaaggggg gcccggtaaa 3060 
c 3061 

<210> 366 
.<211> 1360 
<212> DNA 
<213> Homo sapiens 

<400> 366 

cgggggttgc tccgtccgtg ctccgcctcg ccatgacttc ctacagctat cgccagtcgt 60 

cggccacgtc gtccttcgga ggcctgggcg gcggctccgt gcgttttggg ccgggggtcg 120 

cttttcgcgc gcccagcatt cacgggggct ccggcggccg cggcgtatcc gtgtcctccg 180 

cccgctttgt gtccucgtcc tcctcggggg gctacggcgg cggctacggc ggcgtcctga 240 

ccgcgtccga cgggctgctg gcgggcaacg agaagctaac catgcagaac ctcaacgacc 300 

gcctggcctc ctacctggac aaggtgcgcg ccctggaggc ggccaacggc gagctagagg 360 

tgaagatccg cgactggtac cagaagcagg ggcctgggcc ctcccgcgac tacagccact 420 

actacacgac catccaggac ctgcgggaca agattcttgg tgccaccatt gagaactcca 480 

ggattgtcct gcagatcgac aacgcccgtc tggctgcaga tgacttccga accaagtttg 540 

agacggaaca ggctctgcgc atgagcgtgg aggccgacat caacggcctg cgcagggtgc 600 

tggatgagct gaccctggcc aggaccgacc tggagatgca gatcgaaggc ctgaaggaag 660 

agctggccta cctgaagaag aaccatgagg aggaaatcag tacgctgagg ggccaagtgg 720 

gaggccaggt cagtgtggag gtggattccg ctccgggcac cgatctcgcc aagatcctga 780 

gtgacatgcg aagccaatat gaggtcatgg ccgagcagaa ccggaaggat gctgaagcct 840 

ggttcaccag ccggactgaa gaattgaacc gggaggtcgc tggccacacg gagcagctcc 900 

agatgagcag gtccgaggtt actgacctgc ggcgcaccct tcagggtctt gagattgagc 960 

tgcagtcaca gctgagcatg aaagctgcct tggaagacac actggcagaa acggaggcgc 1020 

gctttggagc ccagctggcg catatccagg cgctgatcag cggtattgaa gcccagctgg 1080 

cggatgtgcg agctgatagt gagcggcaga atcaggagta ccagcggctc atggacatca 1140 

agtcgcggct ggagcaggag attgccacct accgcagcct gctcgaggga caggaagatc 1200 

actacaacaa tttgtctgcc tccaaggtcc tctgaggcag caggctctgg ggcttctgct 1260 

. gtcctttgga gggtgtcttc tgggtagagg gatgggaagg aagggaccct tacccccggc 1320 

tcttctcctg acctgccaat aaaaatttat ggtccaaggg 1360 

<210> 367 
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<211> 1412 

<212> DNA 

<213> Homo sapiens 

<400> 367 ^ 
cggggtcgtc cgcaaagcct gagtcctgtc ctttctctct ccccggacag catgagcttc 60 
accactcgct ccaccttctc caccaactac cggtccctgg gctctgtcca ggcgcccagc 120 
tacggcgccc ggccggtcag cagcgcggcc agcgtctatg caggcgctgg gggctctggt 180 
tcccggatct ccgtgtcccg ctccaccagc ttcaggggcg gcatggggtc cgggggcctg 240 
gccaccggga tagccggggg tctggcagga atgggaggca tccagaacga gaaggagacc 300 
atgcaaagcc tgaacgaccg cctggcctct tacctggaca gagtgaggag cctggagacc 360 
gagaaccgga ggctggagag caaaatccgg gagcacttgg agaagaaggg accccaggtc 420 
agagactgga gccattactt caagatcatc gaggacctga gggctcagat cttcgcaaat 480 
actgtggaca atgcccgcat cgttctgcag attgacaatg cccgtcttgc tgctgatgac 540 
tttagagtca agtatgagac agagctggcc atgcgccagt ctgtggagaa cgacatccat 600 
gggctccgca aggtcattga tgacaccaat atcacacgac tgcagctgga gacagagatc 660 
gaggctctca aggaggagct gctcttcatg aagaagaacc acgaagagga agtaaaaggc 720 
ctacaagccc agattgccag ctctgggttg accgtggagg tagatgcccc caaatctcag 780 
gacctcgcca agatcatggc agacatccgg gcccaatatg acgagctggc tcggaagaac 840 
cgagaggagc tagacaagta ctggtctcag cagattgagg agagcaccac agtggtcacc 900 
acacagtctg ctgaggttgg agctgctgag acgacgctca cagagctgag acgtacagtc 960 
cagtccttgg agatcgacct ggactccatg agaaatctga aggccagctt ggagaacagc 1020 
ctgagggagg tggaggcccg ctacgcccta cagatggagc agctcaacgg gatcctgctg 1080 
caccttgagt cagagctggc acagacccgg gcagagggac agcgccaggc ccaggagtat 1140 
gaggccctgc tgaacatcaa ggtcaagctg gaggctgaga tcgccaccta ccgccgcctg 1200 
ctggaagatg gcgaggactt taatcttggt gatgccttgg acagcagcaa ctccatgcaa 1260 
accatccaaa agaccaccac ccgccggata gtggatggca aagtggtgtc tgagaccaat 1320 
gacaccaaag ttctgaggca ttaagccagc agaagcaggg taccctttgg ggagcaggag 1380 
gccaataaaa agttcagagt tcattggatg tc 1412 

<210> 368 
<211> 1075 
<212> DNA 

<213> Homo sapiens 
<400> 368 

cgcagcaaac acatccgtag aaggcagcgc ggccgccgag agccgcagcg ccgctcgccc 60 
gccgcccccc aecccgccgc cccgcccggc gaattgcgcc ccgcgcccct cccctcgcgc 120 
ccccgagaca aagaggagag aaagtttgcg cggccgagcg gggcaggtga ggagggtgag 180 
ccgcgcggga ggggcccgcc tcggccccgg ctcagccccc gcccgcgccc ccagcccgcc 240 
gccgcgagca gcgcccggac cccccagcgg cggcccccgc ccgcccagcc ccccggcccg 300 
ccatgggcgc cgcggcccgc accctgcggc tggcgctcgg cctcctgctg ctggcgacgc 360 
tgcttcgccc ggccgacgcc tgcagctgct ccccggtgca cccgcaacag gcgttttgca 420 
atgcagatgt agtgatcagg gccaaagcgg tcagtgagaa ggaagtggac tctggaaacg 480 
acatttatgg caaccctatc aagaggatcc agtatgagat caagcagata aagatgttca 540 
aagggcctga gaaggatata gagtttatct acacggcccc ctcctcggca gtgtgtgggg 600 
tctcgctgga cgttggagga aagaaggaat atctcattgc aggaaaggcc gagggggacg 660 
gcaagatgca catcaccctc tgtgacttca tcgtgccctg ggacaccctg agcaccaccc 720 
agaagaagag cctgaaccac aggtaccaga tgggctgcga gtgcaagatc acgcgctgcc 780 
ccatgatccc gtgctacatc tcctccccgg acgagtgcct ctggatggac tgggtcacag 840 
agaagaacat caacgggcac caggccaagt tcttcgcctg catcaagaga agtgacggct 900 
cctgtgcgtg gtaccgcggc gcggcgcccc ccaagcagga gtttctcgac atcgaggacc 960 
cataagcagg cctccaacgc ccctgtggcc aactg.caaaa aaagcctcca agggtttcga 1020 
ctggtccagc tctgacatcc cttcctggaa acagcatgaa taaaacactc atccc 1075 

<210> 369 
<211> 1127 
<212> DNA 

<213> Homo sapiens 
<400> 369 

cacgggcggg gcggggcctg ggtccaccgg ggttctgagg ggagactgag gtcctgagcc 60 

gacagcctca gctccctgcc aggccagacc cggcagacag atgagggccc aggaggcctg 120 

gcgggcctgg gggcgctacg gtgggagagg aagccagggg tacctgcctc tgccttccag 180 

ggccaccgtt ggccccagct gtgccttgac tacgtaacat cttgtcctca cagcccagag 240 

catgttccag atcccagagt ttgagccgag tgagcaggaa gactccagct ctgcagagag 300 

gggcctgggc cccagccccg caggggacgg gccctcaggc tccggcaagc atcatcgcca 360 

ggccccaggc ctcctgtggg acgccagtca ccagcaggag cagccaacca gcagcagcca 420 
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tcatggaggc 
ggaggacgac 
gccccccaac 
gtttgtggac 
gatgcggcaa 
caggggaagc 
cccactgccc 
aaaagaggat 
gtgtgaagcc 
cggaagtggc 
agtcgcgagc 
ggccgtgctc 



gctggggctg 
gaagggatgg 
ctctgggcag 
tcctttaaga 
agctccagct 
tccgccccct 
tgggcagcca 
ccgtgctgtc 
acggaaggct 
tccgtggccc 
caggtttaac 
actaccaaat 



tggagatccg 
gggaggagcc 
cacagcgcta 
agggacttcc 
ggacgcgagt 
cccagtgacc 
tcttgaatat 
tcctttggag 
tggtcccatc 
cgccctcagg 
cgttgcgtca 
gttaataaag 



gagtcgccac 
cagccccttt 
tggccgcgag 
tcgcccgaag 
cttccagtcc 
ttcgctccac 
gggcggaagt 
ggagggctga 
ggaagttttg 
ctccgggctt 
ccgggacccg 
cccgcgtctg 



agctcctacc 
cggggccgct 
ctccggagga 
agcgcgggca 
tggtgggatc 
atcccgaaac 
acttccctca 
cccagattcc 
ggttttccgc 
tcccccaggc 
agcccccgcg 
tgccgcc 



ccgcggggac 
cgcgctcggc 
tgagtgacga 
cagcaacgca 
ggaacttggg 
tccacccgtt 
ggcctatgca 
cttccggtgc 
ccacagccgc 
gcctgcgcta 
atgccctggg 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1127 



<210> 370 
<211> 1890 
<212> DNA 

<213> Homo sapiens 



<400> 370 

cttaataaga 

ttatgcttct 

ccactcctct 

ctttcaccaa 

ccatagtggt 

agttttctgg 

tttccagcaa 

attttgggat 

tggaggagtt 

ctccctgcaa 

attttctcac 

tccaggtctg 

tgcttaaaaa 

catcactgga 

aggaaaagga 

atctatttgt 

tgctgaagca 

gacacaggag 

acgagatcca 

atactaagtt 

cataaaacta 

aatttcctaa 

agaaaagtga 

ttgcccgcat 

ctgttgatga 

caccctcata 

atctgctatc 

gacctctcat 

gagagttgtt 

tattaattgc 

taaaacacag 

aaataaaaag 



agagaaggct 
cttttcactc 
tcctattatt 
tttctcaaaa 
gtttcatgga 
aagaggcaat 
tggaaagaga 
ggggaagagg 
gagaaaaacc 
tgtgatctgc 
cctgatgaaa 
caataatttc 
tgttgctctt 
tgttaacaat 
caaccaaaag 
tgctggaaca 
cccagaggtc 
cccctgcatg 
gagatacagt 
cagaaactac 
gggcacaacc 
tccaaatatc 
ctacttcatg 
ggagctattt 
tttaaagaac 
ccagatctgc 
acctgcaact 
caaatcttcc 
caggtcactg 
tgcatatgct 
aaaagtgatt 
tgttattaat 



tcaatggaac 
tggagacaga 
ggaaatatgc 
gtctatggtc 
tatgaggcag 
tccccaatat 
tggaaggaga 
agcattgagg 
aaggcttcac 
tccgttgttt 
agattcaatg 
cctctactca 
acacgaagtt 
cctcgggact 
tcagaattca 
gagacaacaa 
acagctaaag 
caggatagga 
gaccttgtcc 
ctcatcccca 
ataatggcat 
tttgaccctg 
cctttctcag 
ttatttctaa 
ctcaatacta 
ttcatccctg 
ctttttttat 
cattcactca 
cacaaatata 
aatacttttc 
aatgaatgat 
tgctggttca 



cttttgtggt 
gctgnaggag 
tacagataga 
ctgtgttcac 
tgaaggaagc 
ctcaaagaat 
tccggcgttt 
accgtgttca 
cctgtgatcc 
tccagaaacg 
aaaacttcag 
ttgattgttt 
acattaggga 
ttatggattg 
atattgaaaa 
gcaccactct 
tccaggaaga 
gccacatgcc 
ccaccggtgt 
agagctttga 
tactgacttc 
gccactttct 
caggaaaacg 
ccacaatttt 
ctgcagttac 
tctgaagaat 
caaggacatt 
atatcccata 
tctgcaatta 
taatgctgac 
aatttagtcc 



cctggtgctg 
aaggaagctc 
tgttaaggac 
cgtgtatttt 
cctgattgat 
tactaaagga 
ctccctcaca 
agaggaagct 
cactttcatc 
atttgattat 
gattctgaac 
cccaggaact 
gaaagtaaaa 
cttcctgatc 
cttggttggc 
gagatatgga 
gattgatcat 
ttacactgat 
gccccatgca 
taacaagata 
cgtgctacat 
agataagaat 
aatttgtgca 
acagaacttt 
caaagggatt 
gctagcccat 
cccactatta 
agcatccaaa 
ttcatactct 
tttttaatat 
atttcttttg 



tgtctctctt 
cctcctggcc 
atctgcaaat 
ggcatgaatc 
aatggagagg 
cttggaatca 
aacttgcgga 
cactgccttg 
ctgggctgtg 
aaagatcaga 
tccccatgga 
cacaacaaag 
gaacaccaag 
aaaatggagc 
actgtagctg 
ctcctgctcc 
gtaattggca 
gctgtagtgc 
gtgaccactg 
atgctggctg 
gatgacaaag 
ggcaacttta 
ggagaaggac 
aacctgaaat 
gtttctctgc 
ctggctgctg 
tgtcttctct 
ctccattaag 
gtaacacttg 
gttatcactg 
tgaatgtgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1890 



<210> 371 

<211> 4946 

<212> DNA 

<213> Homo sapiens 



<400> 371 

agtcagccct 

actgcgggac 

cagggctctc 

cggtcggctc 

ccgggcaatg 

gaatgacatc 

tgaggttcac 

ggagccgaag 

caacggaccg 

cccgcgaaaa 



gctgccagcc 
agtgagccga 
aggagccggg 
acactcgagc 
gtggtggctc 
aagggaggag 
tggccagcgg 
agcaggcgga 
gaagtggggg 
ctctgagctg 



agtgccgggt 
gcagaagctg 
tcctgggcaa 
agtaaaaaga 
ggcctagaga 
acgggcggga 
agccggacac 
ccccctccgc 
gcggaagtgc 
gctgacagct 



gctggggact cagggaggcc 
gaacgcagga gaggaaggag 
ggcgcagecg ttttcaaatt 
tgcctctggg gaggaggccc 
ggcggtagtg gaacgcagac 
ccccagattt ctgcctgtgg 
agaacgcgca aaacgccgtg 
gggggaacag tttccgccgg 
agtgggctca gcgccgactg 
ggggacgggt ggcggccctc 
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cgccgggacc 60 
agggggcggt 120 
ttcaggaaag 180 
gtgcagctct 240 
cctggtgggg 300 
gcgatggaag 360 
taggcctgga 420 
gagcacaaag 480 
cgcgcctctg 540 
gactggagtc 600 



wo 2003/078662 



PCT/US2003/007713 



39740-OOOlPCT-txt 

ggttgagttc ctgagggacc ccggttctgg aaggttcgcc gcggagacaa gtgagcagtc 660 
tgtgccatag ggattctcga agagaacagc gttgtgtccc agtgcacatg ctcgcatcgc 720 
ttaccaggag tgcccgagac cctaagatgt tcggagtggt tttttcgcac agacccgaat 780 
agcctgcccc tcagccacgc tctgtgccct tctgagaaca ggctgatatg cccaagatag 840 
tcctgaatgg tgtgaccgta gacntccctt tccagcccta caaatgccaa caggagtaca 900 
tgaccaaggt cctggaatgt ctgcagcaga aggtgaatgg catcctggag agccctacgg 960 
gtacagggaa gacgctgtgc ctgctgtgca ccacgctggc ctggcgagaa cacctccgag 1020 
aciggcatctc tgcccgcaag attgccgaga gggcgcaagg agagcttttc ccggatcggg 1080 
ccttgtcatc ctggggcaac gctgctgctg ctgctggaga ccccatagct tgctacacgg 1140 
acatcccaaa gattatttac gcctccagga cccactcgca actcacacag gtcatcaacg 1200 
agcttcggaa cacctcctac cggcctaagg tgtgtgtgct gggctcccgg gagcagctgt 1260 
gcatccatcc tgaggtgaag aaacaagaga gtaaccatct acagatccac ttgtgccgta 1320 
agaaggtggc aagtcgctcc tgtcatttct acaacaacgt agaagaaaaa agcctggagc 1380 
aggagctggc cagccccatc ctggacattg aggacttggt caagagcgga agcaagcaca 1440 
gggtgtgccc ttactacctg tcccggaacc tgaagcagca agccgacatc atattcatgc 1500 
cgtacaatta cttgttggat gccaagagcc gcagagcaca caacattgac ctgaagggga 1560 
cagtcgtgat ctttgacgaa gctcacaacg tggagaagat gtgtgaagaa tcggcatcct 1620 
ttgacctgac tccccatgac ctggcttcag gactggacgt catagaccag gtgctggagg 1680 
agcagaccaa ggcagcgcag cagggtgagc cccacccgga gttcagcgcg gactccccca 1740 
gcccagggct gaacatggag ctggaagaca ttgcaaagct gaagatgatc ctgctgcgcc 1800 
tggagggggc catcgatgct gttgagctgc ctggagacga cagcggtgtc accaagccag 1860 
ggagctacat ctttgagctg tttgctgaag cccagatcac gtttcagacc aagggctgca 1920 
tcctggactc gctggaccag atcatccagc acctggcagg acgtgctgga gtgttcacca 1980 
acacggccgg actgcagaag ctggcggaca ttatccagat tgtgttcagt gtggacccct 2040 
ccgagggcag ccctggttcc ccagcagggc tgggggcctt acagtcctat aaggtgcaca 2100 
tccatcctga tgctggtcac cggaggacgg ctcagcggtc tgatgcctgg agcaccactg 2160 
cagccagaaa gcgagggaag gtgctgagct actggtgctt cagtcccggc cacagcatgc 2220 
acgagctggt ccgccagggc gtccgctccc tcatccttac cagcggcacg ctggccccgg 2280 
tgtcctcctt tgctctggag atgcagatcc ctttcccagt ctgcctggag aacccacaca 2340 
tcatcgacaa gcaccagatc tgggtggggg tcgtccccag aggccccgat ggagcccagt 2400 
tgagctccgc gtttgacaga cggttttccg aggagtgctt atcctccctg gggaaggctc 2460 
tgggcaacat cgcccgcgtg gtgccctatg ggctcctgat cttcttccct tcctatcctg 2520 
tcatggagaa gagcctggag ttctggcggg cccgcgactt ggccaggaag atggaggcgc 2580 
tgaagccgct gtttgtggag cccaggagca aaggcagctt ctccgagacc atcagtgctt 2640 
actatgcaag ggttgccgcc cctgggtcca ccggcgccac cttcctggcg gtctgccggg 2700 
gcaaggccag cgaggggctg gacttctcag acacgaatgg ccgtggtgtg attgtcacgg 2760 
gcctcccgta ccccccacgc atggaccccc gggttgtcct caagatgcag ttcctggatg 2820 
agatgaaggg ccagggtggg gctgggggcc agttcctctc tgggcaggag tggtaccggc 2880 
agcaggcgtc cagggctgtg aaccaggcca tcgggcgagt gatccggcac cgccaggact 2940 
acggagctgt cttcctctgt gaccacaggt tcgcctttgc cgacgcaaga gcccaactgc 3000 
cctcctgggt gcgtccccac gtcagggtgt atgacaactt tggccatgtc atccgagacg 3060 
tggcccagtt cttccgtgtt gccgagcgaa ctatgccagc gccggccccc cgggctacag 3120 
cacccagtgt gcgtggagaa gatgctgtca gcgaggccaa gtcgcctggc cccttcttct 3180 
ccaccaggaa agctaagagt ctggacctgc atgtccccag cctgaagcag aggtcctcag 3240 
ggtcaccagc tgccggggac cccgagagta gcctgtgtgt ggagtatgag caggagccag 3300 
ttcctgcccg gcagaggccc agggggctgc tggccgccct ggagcacagc gaacagcggg 3360 
cggggagccc tggcgaggag caggcccaca gctgctccac cctgtccctc ctgtctgaga 3420 
agaggccggc agaagaaccg cgaggaggga ggaagaagat ccggctggtc agccacccgg 3480 
aggagcccgt ggctggtgca cagacggaca gggccaagct cttcatggtg gccgtgaagc 3540 
aggagttgag ccaagccaac tttgccacct tcacccaggc cctgcaggac tacaagggtt 3600 
ccgatgaott cgccgccctg gccgcctgtc tcggccccct ctttgctgag gaccccaaga 3660 
agcacaacct gctccaaggc ttctaccagt ttgtgcggcc ccaccataag cagcagtttg 3720 
aggaggtctg tatccagctg acaggacgag gctgtggcta tcggcctgag cacagcattc 3780 
cccgaaggca gcgggcacag ccggtcctgg accccactgg aagaacggcg ccggatccca 3840 
agctgaccgt gtccacggct gcagcccagc agctggaccc ccaagagcac ctgaaccagg 3900 
gcaggcccca cctgtcgccc aggccacccc caacaggaga ccctggcagc caaccacagt 3960 
gggggtctgg agtgcccaga gcagggaagc agggccagca cgccgtgagc gcctacctgg 4020 
ctgatgcccg cagggccctg gggtccgcgg gctgtagcca actcttggca gcgctgacag 4080 
cctataagca agacgacgac ctcgacaagg tgctggctgt gttggccgcc ctgaccactg 4140 
caaagccaga ggacttcccc ctgctgcaca ggttcagcat gtttgtgcgt ccacaccaca 4200 
agcagcgctt ctcacagacg tgcacagacc tgaccggccg gccctacccg ggcatggagc 4260 
caccgggacc ccaggaggag aggcttgccg tgcctcctgt gcttacccac agggctcccc 4320 
aaccaggccc ctcacggtcc gagaagaccg ggaagaccca gagcaagatc tcgtccttcc 4380 
ttagacagag gccagcaggg actgtggggg cgggcggtga ggatgcaggt cccagccagt 4440 
cctcaggacc tccccacggg cctgcagcat ctgagtgggg cctctaggat gtgcccagcc 4500 
tgccacaccg cctccaggaa gcagagcgtc atgcaggtct tctggccaga gccccagtga 4560 
gtgcccacgg aggcccccag cacacccaac gtggcttgat cacctgcctg tccagctctg 4620 
gtgggccaag aacccaccca acagaatagg ccagcccatg ccagccggct tggcccgctg 4680 
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